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Introduction 


Elements of Statistics: It has been written to fulfil the needs of Ist Sem. B.C.A. Students. 
It is needless to say that the Matrix, Sets, Arithmetics. Detailed chapters of Differentiation 
and Integration, A.P., G.P. and H.P. gives the better outline to acquire Mathematical 


knowledge. 


Features 


Accessibility: Each chapter of the book has been written in a simple & lucid style so that it 


is easily accessible. The basic idea of each topic is carefully and clearly explained. 


Prerequisites: Prerequisite topics of each chapter are listed in the contents of the book 
and a basic understanding of those topics will suffice for the study of that particular 
chapter. 


Exercise Sets: Each chapter includes a comprehensive exercise set containing routine, 
intermediate and challenging exercises so that the learners may have sufficient practice in 


the subject. Solutions to all exercises are given at the end of each chapter. 


Goals 


The goal of this book is to present a wide variety of interesting applications of B.C.A. 
Statistics in a flexible and accessible format. The book entitled Elements of Statistics for 
B.C.A. has been written to meet requirements of B.C.A. course of all Indian Universities 
on this subject. It covers topics from Sets, Percentage, A.P., G.P., H.P., Differentiation and 
Integration, Matrices. This book contains illustrative examples, theorems with proofs and 
comprehensive exercises. Throughout the book, the subject matter has been dealt in such a 
simple, logical and systematic way that students will not find any difficulty to understand 
it. 


(iv) 


(P REFACE 


n 
OB egg 


I t gives me pleasure to place the book ‘Elements of Statistics for B.C.A.’. (All 
Universities in U.P.) which has been completely revised and made upto date in the 

light of suggestions received from the students and the learned teachers of various 

universities. 

The following are the main features of third edition: 

> More than 150 new problems and examples have been added. 

> Includes recent elegant and elementary proof. 


> All chapters are arranged according to the latest U.P. Unified Syllabus implemented 
from 2012 - 2013, keeping in view the requirement of the students of B.C.A. IIrd 
semester of all Universities in Uttar Pradesh (U.P). 


> Errors and omissions have been corrected. 


> Up-to-date yearwise references from various universities examination papers have 
been given throughout the book. 


> Each step is made clear by giving reasons with clarification. 


We hope that the book in its present form will prove beneficial to the students. We shall 
also forward to receive your suggestions and comments bout the book so that it could be 


further improved. 


a J.P. Chauhan 
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Elements of Statistics f STATISTICS 


U.P. UNIFIED (w.e.f. 2012-2013) 
B.C.A. S. 205 B.C.A. IId Semester 3104 


Unit-I 
Population, Sample and Data Condensation: Definition and scope of statistics, 
concept of population and simple with Illustration, Raw data, attributes and variables, 


classification, frequency distribution, Cumulative frequency distribution. 
Unit-II 
Measures of Central Tendeny: Concept of central Tendency, requirements of a good 


measures of central tendency, Arithmetic mean, Median, Mode, Harmonic Mean, 


Geometric mean for grouped and ungrouped data. 
Unit-I 


Measures of Dispersion: Concept of dispersion, Absolute and relative measure of 
dispersion, range variance, Standard deviation, Coefficient of variation. 

Unit-IV 
Permutations and Combinations: Permutations of 'n' dissimilar objects taken 'r' at a 
time (with or without repetitions). "P'=n!/(n-r) !(without proof). Combinations of 'r' 
objects taken from 'n' objects. "C’=n!/(r!(n-r)!) (without proof) . Simple examples, 
Applications. 

Unit-V 


Sample Space, Events and Probability: Experiments and random experiments, 
Ideas of deterministic and non-deterministic experiments; Definition of sample space, 
discrete sample space, events; Types of events, Union and intersections of two or more 
events, mutually exclusive events, Complementary event, Exhaustive event; Simple 
examples. 

Classical definition of probability, Addition theorem of probability without Proof (upto 
three events are expected). Definition of conditional probability Definition of 


independence of two events, simple numerical problems. 
Unit-VI 
Statistical Quality Control: Introduction, control limits, specification limits, 


tolerance limits, process and product control; Control charts for X and R; Control charts 


for number of defective {n-p chart} control charts for number of defects {c - chart}. 
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Chapter 


Statistics Concept, 
Significance and Data 


1.1 Definition of Statistics 


[B.C.A. (I.G.N.O.U.) 2011, 2007; B.C.A. (Lucknow) 2008, 2006; 
B.C.A. (Rohilkhand) 2008, 2004; B.C.A. (Agra) 2007; B.C.A. (Meerut) 2000] 


There have been many definitions of the term ‘Statistics’ which can be broadly classified 
in two senses: 


l. Statistics in Plural Sense 


2.: Statistics in Singular Sense 


1.1.1 Statistics in Plural Sense 

As a plural noun statistics is used for denoting numerical and quantitative information. 
Thus, in plural sense, it means the same thing as data. For example, statistics of scores in 
a cricket match, trade related statistics, labour statistics, census statistics, godown 
statistics and so on. 


All the quantitative observations are not statistics but those quantitative observations 
which satisfy certain conditions are called ‘Statistics’. Also, economist Horace Secrist, 
defined ‘Statistics’ as: 


“By Statistics we mean aggregates of facts affected to a marked extent by multiplicity of causes, 
numerically expressed, enumerated or estimated according to reasonable standards of accuracy, 


collected in a systematic manner for a predetermined purpose and placed in a relation to each other”. 


= Elements of Statistics 


According to this definition of statistics, the quantitative expression should satisfy the 


following conditions: 


(i) 


(ii) 


(iii) 


(iv) 


(v) 


(vi) 


(vii) 


Statistics are a Collection of Facts: Single or isolated numerical expressions are 
not statistics as they can’t be compared and put in relation. For example, Rohan 
scored 127 runs in a match, is not statistics but 10 batsmen scored 32, 57, 10, 0, 
98, 15, 2, 0, 66 and 7 runs respectively are statistics. Hence, a set or a collection of 
observation forms statistics. 


Statistics are Collected in a Suitable Order: Statistics should be collected with a 
suitable plan of data collection. Statistics collected without any system or order, 
will result in failed decisions. 


Statistics are Collected for a Predefined Purpose: The statistics should be 
collected for a specific purpose. For example, while collecting statistics of trading, 
we must decide beforehand the regions, commodities and the periods for which 
they are needed. 


Statistics are Expressed Numerically: Quantitative expression are not statistics 
unless are expressed in numerical forms. Statements like ‘India’s population is 
increasing’, ‘prices of petrol will go high’, ‘Ram is good at Economics’ are not 
statistical statements but statements like “There is 63% increase in food items’, 
‘Average poverty ratios in India is 26’ are statistical statements. 


Statistics are Affected to a Marked Extent by Multiplicity of Causes: 
Statistics or data are affected by a number of forces operating together i.e., they are 
dependent on a number of phenomena. For example, statistics of production of an 
agricultural commodity are affected by rainfall, soil quality, manuring process and 
method of cultivation etc. Similarly, it same for other sectors also. 


Statistics are Placed in Relation to Each Other: Statistics are collected for 
making comparisons between related phenomena over a specific region or period. 
For example, the population figures of India, UK, USA, China etc. in 2011 census 
are comparable as they belong to same census year. Thus, for a valid comparison, 
statistics are required to be put in relation to each other. 


Statistics are Enumerated or Estimated According to Reasonable Standards 
of Accuracy: Enumeration means a precise and accurate numerical statement. 
These statements can be obtained by two ways either by actual counting or 
measurement or by estimate. For example, the number of students in class can be 
counted with perfect accuracy but in situation, such as in a rally, one has to 
estimate the number of people. Similarly, exact measurements on heights, weights, 
distances etc. are not possible and one has to make estimates with reasonable 
standards of accuracy. 
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1.1.2 Statistics in Singular Sense 


Whenever a huge amount of data is collected, there arise a need to organize, present, 
analyse and interpret them in a systematic order. Statistics, in a singular sense is a branch 
of science which deals with scientific methods of collection, organization, presentation 
analysis and interpretation. According to Croxton and Cowden, “Statistics may be 
defined as a science of collection, presentation analysis, and interpretation of numerical 
data”. 


Thus, we have the following five stages in a statistical investigation: 


(i) Collection of Data: With a specific and well defined purpose is the first step in a 
statistical investigation. If the collected data are faulty, the conclusions drawn are 
bound to be fallacious. Hence, data should be collected with a maximum core as 
they form the foundation of statistical analysis. 


(ii) Presentation of Data: Collected and organised data are presented in a systematic 
order to facilitate statistical analysis. The organised data are presented in the form 
of tables, graphs and diagrams to allow better understanding and comparison. 


(iii) Organisation of Data: is also an important step in statistics as deriving any 
conclusions from the observations only is difficult. So, its purpose is to provide a 
methodology for classification and description in a summary form. Editing, 
classification and tabulation are three step in organisation of data. 


(iv) Analysis of Data: Statistics plays two important roles in analysis of data: 
(i) Descriptive statistics and (ii) Inferential statistics. 


Difference between these two are: 


Descriptive Statistics Inferential Statistics 


It is concerned with describing and |It is concerned with certain 
summarising the numerical properties of | derivation of scientific inference 


data. Its methodology includes | about generalisation of results from 


classification, tabulation, graphical | the study of a few particular cases. 

presentation and calculation of certain | Technically speaking, the methods of 

indicators of data such as mean range, | inferential statistics helps in 

standard deviation etc. which summarise | generalising the results of a sample to 

certain important features of data the entire population from which the 
sample is drawn. 


The nature of inference is that we make general statements from the study of a few 
cases. In such case, generalisation can’t be made with certainty. Thus, language of 
probability is often used in statistics conclusions. 
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1.1.3 Distinction between the Two Senses of Definitions 


Statistics as Data Statistics as Method 


It is plural. . Itis singular. 


It refers to a sense of data. . It refers to any statistical method. 


It may be of primary or secondary | 3. It is of scientific nature. 


nature. 


It is in the form of raw materials. . It is in the form of tools applied for 
processing the materials. 


1.2 Scope of Statistics 


[B.C.A. (I.G.N.O.U.) 2010; B.C.A. (Lucknow) 2010, 2008, 2004; 
B.C.A. (Rohtak) 2009; B.C.A. (Meerut) 2006, 2004] 


Scope of statistics is indeed very fast. It covers all fields of study where in quantitative 
appear and need analysis. It is essential from the state of planning to the stage of 
conclusions. It applies to all sciences and applications, such as: agricultural, engineering, 
medical, physical, economics, business, commerce, industry etc. 


1.2.1 Statistics and State 


As per reports of collection of data statistics was developed as a state craft for 
maintaining economic and military strength and for formulating different policies. 
Gradually, the sphere of data collection to other fields such as sales, health, taxes, import, 
export, production etc. Now, the functions of a state have increased due to increase in the 
concept of welfare state. For e.g. Population. 


1.2.2 Statistics and Economics 
[B.C.A. (A.M.U.) 2008; B.C.A. (Kashi Vidhyapeeth) 2007] 


Statistics as ‘data’ and ‘methodology’ have proved immensely useful in solving a variety 
of economic problems related to wages, prices, analysis of time series, income and 
demand analysis. The development of various economic theories own greatly to 
statistical methods. e.g., “Engel’s law of family expenditure’, ‘Mathusian theory of 
population’. The impact of mathematics and statistics has led to the development of new 
disciplines like ‘Econometrics’ and ‘Economic statistics’. 
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1.2.3 Statistics and Economic Planning 
[B.C.A. (Kashi Vidhyapeeth) 2011; B.C.A. (A.M.U.) 2011, 2009; 
B.C.A. (Lucknow) 2010, 2006] 


Statistics is the most important tool in economic planning which is essential for proper 
utilisation of natural resources. Drawing plan, execution and review is based on 
statistical analysis. Priorities of expenditure of a national budget can be determined 
through the proper understanding of various economic problems, viz., balance of trade, 
industrial growth, disparities in income and wealth, agricultural production etc. 
Time-bound goals are set after listening the priorities of growth in different economic 
activities. Maximum utilisation of available resources can be made by using econometric 


models based on statistical analysis. 


1.2.4 Statistics and Business 
[B.C.A. (A.M.U.) 2011; B.C.A. (Avadh) 2008, 2005] 


With growing size and increasing competition, the problems of business enterprises have 
become complex. Statistics is now considered as an indispensable tool in the analysis of 
activities in the field of business, commerce and industry. In business, the manufacturer 
is always interested to estimated the immediate and future demand of his product. The 
object can be achieved by properly conducted market survey and research which greatly 
depend on statistical methods. The trends in sales and production can be determined by 
statistical methods like time-series analysis which are essential for future planning of the 
phenomena. Similarly, the correlation and regression analysis can be used to judge the 


effectiveness of various sales promotion measures. 


Statistical concepts and methods are also used in controlling the quality of products to 
the satisfaction of consumer and the producer. Thus, method of quality control and 


reliability analysis are invaluable in industrial engineering. 


1.2.5 Statistics and Psychology and Education 
[B.C.A. (Rohilkhand) 2008, 2006] 


Statistics has found wide application in psychology and education. It forms the basic of 
scientific approach to problem in which inductive inference is commonly used. Students 
of psychology and education cannot remain ignorant of such methods of problem solving 
in their disciplines. Also, statistical methods are used to measure human ability (for e.g., 
intelligence, aptitude, personality, interest etc.) by tests. Theory of learning is also based 
on statistical principles. Application of statistics in psychology and education have led to 


the development of new discipline called “Psychometry’. 
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1.2.6 Statistics and Medical, Biological and Agricultural Sciences 


Statistics is also one of the most prominent tool in the field of Medicines, Biology and 
Agriculture. In Medicine sciences, one is greatly concerned with the results obtained 
from the use of medicines and drugs. Similarly, the important role of various factors in 
the growth and development of a plant is important in Biology. In Agriculture, the role of 
factors viz., manure, rainfall, watering process, seed quality, affecting the production 
need to be analyzed for good results statistical techniques like ‘analysis of variance’ and 
‘Design of experiments’ are useful for isolating the role of such factors. 


1.3 Limitation of Statistics 
[B.C.A. (Kurukshetra) 2011; B.C.A. (Purvanchal) 2009, 2007; 
B.C.A. (I.G.N.O.U.) 2007; B.C.A. (Kanpur) 2007, 2005; B.C.A. (Meerut) 2004, 2002] 


Though, statistics has many application in various important areas but it also has some 
limitations. Hence, for proper use of statistics, it is also necessary to know the limitations 
and misuse of statistics. The following are the limitations of statistics: 


l. Statistics can be Misused and Misinterpreted: The greatest limitation of 
statistics is that it is likely to be misused. Inadequate and faulty procedure of data 
collection and inappropriate comparisons may arrive at fallacious conclusions. 


For e.g., when conclusions are based on incomplete information and are drawn by 
unskilled investigators, then it is said to be the misuse of statistics. 


2; Statistics Deals Only with Quantitative Data: If the study yields qualitative 
data which cannot be meaningfully converted to quantitative data, then valid 
conclusions cannot be drawn using statistical analysis. 


3. Statistics Deal with Aggregates of Items and not with Individual: Statistics is 
the study of mass of data and deals with collective information. A single data or 
individually constituted data is not considered as a statistical analysis. For e.g. 
Akash scored 81 marks out of 100 in Economics, is not a statistical data, whereas, 
Akash’s average marks are 92 is statistical data. 


4. Statistical Laws are True Only on An Average: Laws of statistics are not 
universally applicable as the laws of physics, chemistry and mathematics. These 
may not be true for particular individual. If it is statistically established that 
annual income of a man is 3 lakhs to 5 lakhs then this statement is true only on an 
average and not with individual. 


5. Statistics is Only One of the Methods of Studying a Phenomenon: Statistical 
methods don’t always provide the best possible solution to a given problem. In 
varying culture and religious situation, it may fail to reveal the underlying facts 
responsible for the variations in a phenomenon under study. Thus, statistical 
conclusions are to be supplemented by other variations. 
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1.4 Distrust and Misuse of Statistics 


[B.C.A. (B.H.U.) 2011, 2008] 


We have observed that statistical techniques are important tools in analysing problems 


in different areas of life. But, still there is a considerable distrust in statistics i.e., lack of 


faith and confidence in statistical methods and statements. It is often said that: 


l. 


2 
3. 
4 


Statistics is another form of lying. 
There are three kinds of lying — lies, damned lies and statistics. 
Statistics can prove or disprove anything. 


Statistics reveal what is convincing but conceals the truth. 


1.4.1 Reasons for Distrust in Statistics 


[B.C.A. (Kanpur) 2008, 2004] 


Figures are innocent and convincing due to which unawared people believe them 
easily. 


Statistical data used for providing arguments may be inaccurate and incomplete 
and will lead to fallacious conclusions. 


Even if the figures are correct, they may be presented or interpreted in such a 
manner that the concerned persons get misleaded. For e.g. 2 companies X and Y are 
paying = 30,000 and Ẹ 45,000 per month respectively. Now, by this statement 
company Y seems more interesting. But, it would be misleading until we know the 
number of employees and the number of working hours etc. Thus, in such cases, we 
might draw different conclusions, Hence in the view of this statistics, it should be 
used with utmost carefulness. Thus, it is commonly said that ‘statistics neither 
proves anything nor disproves anything’. It is only a tool, which will be good if 
used in proper manner otherwise it may lead to bad decisions. 


The fault doesn’t lie in statistics but this distrust arise because of its misuse on the 
part of untrained practitioners of statistical methods, ignorant users of statistical 
results and the dishonest people who use statistics to mislead and confuse rather 
than to clarify and illuminate. 


“Science of statistics is the most useful servant but only of great value to those, 
who understand its proper use”. 


For e.g., Hockey is a nice game for physical exercise but wrong conduct of the game 
and wrong rules & regulations may make it a dangerous game. But, game of 
Hockey can’t be blame for a wrong outcome. Similarly, if the statistical facts are 
mis-interpreted by the people to serve their ends, it will be unjust to blame the 
science as such. Instead, people are responsible for these faults. 
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1.5 Nature of Statistics (Science or Art) 


[B.C.A. (Bundelkhand) 2008; B.C.A. (Kanpur) 2007; B.C.A. (Bundelkhand) 2006] 


1.5.1 Statistics as Science 


Science refers to a systematised body of knowledge. It studies cause and effect relationship 
and attempts to make generalisation in the form of scientific principles of laws. Like other 
sciences, statistical methods are also used to answer the question like, how an investigation 
should be conducted ? In what way the valid and reliable conclusions can be drawn ? How 
far are these conclusions dependable ? Hence, statistics is a science. The science of 
statistics is different from physics, chemistry etc. Statistics is called the science of scientific 
methods. That is why some statisticians do not consider it a pure science but a scientific 
method. According to Cowden and Croxton, “Statistics is not a science, it is a scientific 
method”. In the words of Tippet, “As science, the statistical method is a part of the 


general scientific method and is based on the same fundamental ideas and processes”. 


1.5.2 Statistic as an Art 


An art is an applied knowledge . It refers to the skill of handling facts so to achieve a given 
objective. It is concerned with the ways and means of presenting and handling data, 
making inferences logically and drawing relevant conclusions. Statistics is an art which 
tells us how to use statistical rules and principles to study the problems and finding their 
solutions. By the application of statistical method a number of complex problems can be 
understood easily. Collection of statistical (data), it use and utility are itself an art. How 
the desired results are obtained by using the rules of statistics, which statistical methods 
are used to solve special type of problems ? How the two facts are compared and by 
establishing correlation conclusions are drawn ? All these aspects actually relate to art 


and this is the reason why statistics is an art. 


1.5.3 Statistics is both Science and Art 


After studying, the science and art aspect of statistics it is said that statistics is both 
science as well as Art. Statistics is used not only to gain knowledge but also to understand 
facts and draw important conclusion from it. This type of conclusions prepare a path of 
the economic and social progress. According to Tippet, “Statistics is both a science and 
an art. It is a science in that its methods are basically systematic and have general 
application and an art in that their successful application depends to a considerable 
degree on the skill and special experience of the statistician and his knowledge of the field 


of application, e.g.—Economics”’. 
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1.6 Functions of Statistics 
[B.C.A. (Rohtak) 2010; B.C.A. (Lucknow) 2007] 


It Simplifies Complex Data: Statistical methods like averages, totals, 
percentages, etc. help in condensing mass of data into a few significant figures so as 
to make them easily understandable. 


It Presents the Facts in a Definite Form: Statistics deals with the quantitative 
statements of facts and thus presents them in a precise and definite form. 
Quantitative statements of facts are always more convincing than vague statements. 
For example, the statements like ‘the birth rate in India is decreasing’, ‘the prices of 
commodities are rising’, ‘Mr. Yogesh has passed the high school examination’, etc. 
do not convey information in a definite form as they do not involve quantitative 
statements of facts. 


It Provides a Technique of Comparison: Comparison of quantitative facts is 
also an important function of statistics. For example, area-wise and period-wise 
comparison of data related to production, sales, export, import, population, etc. 
are helpful for drawing valid conclusions about economic activities. 


It Studies Relationship: Correlation analysis is used to find the functional 
relationship between two or more variables. The relationship is very helpful in 
making estimates and forecasting future trends. For example, the relationship 
between supply and demand, advertisement and sales, etc. provide us the basic tool 
for planning these activities. 


It Helps in Formulating Policies: As discussed earlier, inferential statistical 
methods are helpful in formulating various economic and business policies. 
Empirical studies of data concerned with imports, exports, wages, production, etc. 
are helpful in framing suitable policies and plans in the respective areas. Some well 
established law such as ‘Malthusian theory of population’ and ‘Engel’ law of family 
expenditure’ are based on statistical analysis of relevant data. 


It Helps in Forecasting: Statistics is now an indispensable tool in the analysis of 
activities relating to business, commerce and industry. Such an analysis is useful 
for determining trends in activities related to these areas, which in turn form the 
basis of estimation and forecasting about the phenomenon under study. 


It Helps to Test and Formulate Theories: The function of statistics is also to test 
and formulate policies in different fields and to measure their effects. For example, 
statistical methods are used in agricultural and biological science in determining 
the role of various factors of growth and development of the organism or plant 
under study. In medical sciences, the effects of drugs are recorded and analysed. In 
all such studies, effects of different factors can be tested by collecting and 
comparing the relevant data. Properly conducted consumers and market surveys 
are indispensable tools in formulating precise and definite production policies. 


l. 
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1.7 Statistical Units 


In Practical Field of Survey the definition of location of statistical unit remains a 


pre-requisite for the collection, interpretation and presentation of the numerical 


data. A unit stands for the qualitative or quantitative measure in terms of which 


the data are collected and analysed. 


(i) 


(ii) 


(iii) 


(i) 


(ii) 


(iii) 
(iv) 


Statistical Units can be Divided into the Following Three Categories: 


Unit of enumeration is that in terms of which objects are measured and 
collected. Such units may be simple or composite. A simple unit denotes only 
one attribute or consideration without any qualifying limit attached to it. 
Suppose at a particular hours and place 4000 passengers travelled over 
N-railway and 600 over S-railway. Then the comparison of popularity can not 
be made without knowing the distance travelled in both the cases. If it is 
found the N-railway passengers covered a distance 200 kms and those of 
S-railway 125 kms, then to have a correct comparison of the popularity of two 
trains, the total number of passengers-kilometers in both the cases are 
obtained, which is 


200 x 4000 = 800,000 for N-railway 
and 125 x 6000 = 750,00 for S-railway 


indicate that former one is more popular. The other examples of composite 
unit, naan-hour, killo-watt-hour, rupee-gram etc. 


Unit of analysis and interpretation is that in terms of which data are 
analysed, compared and interpreted rates, percentages are used for this 
purpose. 


Unit of presentation is that in terms of which data are presented. However, 
ultimately the nature of such units will belong to either of the first two 
categories only. Because data are presented with respect to time place and 
condition either after measurement and collection or after its statistical 
analysis and interpretation. 


Characteristics of a Good Statistical Unit 


Unit should be clear and unambiguous otherwise different people will give 
different interpretations. For example, the unit of household has to be 
defined specifically whether it will stand for a family related by blood, or for 
those living together collectively or having common kitchen. 


Unit should be self-explanatory. The persons for whom survey or the 
investigation has been conducted should be able to understand and interpret 
the unit easily. 


Unit should be rigid and definite without any flexibility in nature. 


Unit should be relevant. For a unit relevant and suitability to the purpose 
of enquiry is essential. For example, natural income should be measured in 
national coins. Similarly, for the study of giving conditions of the people, 
house hold is a suitable unit. 
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1.8 Collection of Data 


Collection of data is the first step in any statistical investigation of a phenomenon. We know that 


building cannot be constructed without the right bricks, similarly we cannot come to any conclusion if 


the collection of data is not proper. 


Data are generally classified into the following two groups: 


l. 
l. 


Internal Data 2. External Data 


Internal Data: Internal data comes from internal sources related with the functioning of 
an organization or firm where records regarding purchase, production, sales, profits 
etc. are kept on a regular basis. Various Government departments like Railways, 
Communications and Education etc., also generate internal data which are useful 
for their proper internal functioning. However, the internal data can be either 
insufficient or inappropriate for the statistical enquiry into a phenomenon. In that 
situation we need external data. 


External Data: External data are collected and published by external agencies. This type 
of data can be obtained from primary source or secondary source. Thus, the 
external data can further be classified as: Primary and Secondary data. 


(G) Primary data are original and first hand information. Data are termed primary 


when the reference is to data collected for the first time by the investigator. For 
example, the Meteorological department regularly collects data on various 
aspects of the weather and climate such as amount of rainfall, humidity, 
minimum and maximum temperature of a certain place. These constitute 
primary data. Similarly, the data in a population census obtained by the office of 
the Registrar General and Census Commissioner are primary in nature. 


(ii) Secondary data on the other hand, data are termed secondary when 
collected from records or data already available. In other words, the 
secondary data are one which has already been collected by a source other than the 
present investigator. For example, population census data are primary for the 
office of the Registrar General and Census Commissioner whereas for other 
organizations or individuals who use such data, they are secondary. Thus, 
data which are primary in one hand become secondary in the hands of others. 


1.8.1 Difference between Primary and Secondary Data 


[B.C.A. (Bhopal) 2010, 2006; B.C.A. (Meerut) 2002] 


The primary data are absolutely new and are collected a fresh whereas the 
secondary data are those which were previously collected and published by some 
previous investigators and are being simply used by the present investigators. 


The primary data are collected according to the plan of the investigator and so it 
requires much money, labour and time in collecting them, whereas the secondary 
data is always readily available and can be borrowed easily by the present 
investigator for his purpose. 
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Gii) 


The primary data are collected according to the requirement of the problem under 
consideration while, the secondary data was collected for another purpose and is 
moulded according to the present requirement. 


1.8.2 Methods of Collecting Primary Data 


Primary data are collected for the first time through census or sample surveys. The 


following are the methods of collecting primary data: 


(i) 

(ii) 
(iii) 
( 
( 


iv) 


(i) 


Direct Personal Interviews or Observations 
Indirect Personal Interviews or Observations 
Mailed Questionnaires 

Schedules through Enumerators 


Information through Local Sources or Correspondents 


Direct Personal Interviews or Observations: In this method the investigator 
contacts the source of information directly and personally. This method of 
collecting data is suitable in the following situations: 


(a) When the area of enquiry is limited. 
(b) When high degree of accuracy is needed in the results. 


(c) When the investigation is difficult and is to be conducted in a heterogeneous 


area. 


(d) When the information collected is of confidential nature. 


Merits [B.C.A. (Agra) 2003] 


(a) Information collected by this method is accurate and reliable. It can be 


checked and rechecked and put in the desired form. 
(b) Alternative supplementary information can be gathered simultaneously. 
(c) Collected information can be made free from bias and prejudices. 
(d) This is the most suitable method when the area of investigation is limited. 
Demerits [B.C.A. (Agra) 2003] 


(a) Such a method can be adopted only when the enquiry is intensive and 


localized. It cannot be adopted when the field of enquiry is wide. 
(b) The involvement of personal bias and prejudices is obvious. 


(c) Investigators should be trained and efficient persons, otherwise the 


information collected may be unreliable. 
(d) Itis not scientific. 


(e) It requires too much time and labour. 
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(ii) 


(iii) 


Indirect Personal Interviews or Observations: In some cases, the informants 
cannot be contacted directly. In this situation an indirect personal enquiry is 
conducted to get the desired information. The indirect personal investigation is 
done through some agencies who have some knowledge of the phenomenon under 
enquiry. For example, an investigator may collect information about cost of 
cultivation indirectly from the village Pradhan (headman) who have intimate 
knowledge of such matters. This method is useful in the following cases : 


(a) When the area of investigation is large. 
(b) When the information cannot be obtained directly from the informants. 


(c) This method is generally used by the government or committees and 
commissions appointed by the government. They collect information about 


different problems by interviewing the person concerned. 
Merits 
(a) It is useful when the area of investigation is wide. 
(b) It saves labour, time and money. 


(c) As the information is collected from knowledgeable persons, it is expected to 


be useful and reliable. 


(d) The enquiry could be more extensive and different aspects of the problems 


can also be studied properly. 
(e) This method is easy. 


(£) It is free from the prejudices of the investigator. 


Demerits 


(a) The information collected may be subjective and is subject to the personal 


bias of the persons or agencies from whom it is collected. 


(b) One has to be very careful about the selection of persons from whom the 
information is collected, because their knowledge and personal attitude 
towards the problem greatly affect the quality of data. So one has to be very 


careful in dealing with such situations. 


Mailed Questionnaire Method: In the mailed questionnaire method, a 
questionnaire in the form of a set of questions is sent by mail to the informants. 
They are expected to answer the questions and also the additional needed 
information, whenever needed and mail them back to the investigator. This 
method should be used in the following cases: 


(a) When the area under investigation is wide. 
(b) When the informants are educated. 


(c) When the informants are expected to leave for far away places. 
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Merits 


(iv) 


(a) 
(b) 


It is economical and less time consuming. 


A larger area can be covered under the investigation as even the people in 
distant places can be reached without much difficulty. 


Collected information may be more reliable. 


(d) Itis free from the bias of the investigator as the information is collected from 
the informants themselves. 

Demerits 

(a) This method can be adopted only in cases when the informants are educated 


(b) 


(c) 


persons. 


All the informants may not co-operate and the information to be collected 
may remain incomplete. Sometimes people may not spare time to fill the 
questionnaire or they may not be willing to return the filled questionnaire. 
Obviously collected information will not be completed. 


The greatest disadvantage of this method is the unwillingness of persons to 
reply at all or in full due to carelessness or fear of exposure. 


Schedule Through Enumerators: Initially let us make a distinction between a 


questionnaire and a schedule. The questionnaire is a set of questions and the 


answers to which are recorded by the informant itself, whereas in a schedule 


answers are recorded by the investigator or an enumerator on his behalf. In this 


method the investigators or enumerators approach the informants with a prepared 


questionnaire and get the replies to the questions. This method is generally used in 


censuses and large scale surveys. 


In the case of census, investigators visit every member of the source of information 


in the zones while, in the case of sample survey, they collect information from 


those members who have been selected in the sample. 


Merits 

(a) This is the only method which is applicable in census and sample survey both. 
(b) Subjectivity on the part of the interviewer is less in this method. 

(c) This method is useful where the scope and coverage is very large. 

(d) Personal contacts increase the reliability of data. 

(e) Information from an illiterate person too can be obtained. 

Demerits 

(a) The method is quite costly and time consuming. 


(b) 
(c) 


Trained and efficient investigators will be needed to get the information. 


The method can be used only by big organizations. 
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1.9 Framing a Questionnaire or a Schedule 


Great care and caution has to be exercised in drafting a questionnaire or a schedule, as 
this is the basis of collecting information in an investigation. Apart from the care and 
caution, a lot of expertise and experience of the phenomenon under investigation are 
required in its preparation. Though there are no hard and fast rules for drafting a 


questionnaire, still following points should be given due consideration. These points are: 


l. Covering Letter: In this letter the investigator should introduce himself and make 
the objectives of the survey clear to the informant. In this, the informant should be 
assured that the information provided by him will be kept secret. A self-addressed 
stamped envelope should also be sent, so that the informant can return the 


questionnaire. 


2. Number of Questions: The number of questions in the questionnaire should be 
as small as possible. This results in time saving and is convenient to both, i.e., to 


the enumerator and the respondent. 


3: Nature of Questions: Delicate questions should be put with great care. Often 


indirect questions should be formed to get the answers to these questions. 


4. Simplicity of Questions: The questions set should be clear, concise, short 
answered and unambiguous. These should be related to the phenomenon under 


investigation. 


Di Arrangement of Questions: There should be a natural and logical order of the 
questions in a questionnaire. For example, it is not logical to ask a man about his 


income before asking him about his occupation. 


6. Information Collected is Usable: It should be noted that the information 


collected through questions is usable. 


7A Avoid Mathematical Questions: As far as possible, questions involving 
mathematical calculations be avoided. It is always better to use multiple choice 
questions (having four or five alternatives) or simple alternative questions (Yes or 


No type). 


8. Attractive Layout of the Questionnaire: The book of the questionnaire should 


be attractive and questions be suitably spaced for proper answering. 
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1.10 Collection of Secondary Data 


[B.C.A. (Meerut) 2008, 2004; B.C.A. (Lucknow) 2007, 2006, 2004] 


Secondary data are those which are collected by some other agency and are used for 


further investigation. The sources of secondary data can be classified into two: 


J; 
2. 


Published Sources 


Unpublished Sources. 


Published Sources 


Some of the published sources providing secondary data are: 


(i) 


(ii) 


(iii) 


(iv) 


(v) 


(a) 


Government Publications: A number of government, semi-government and 
private organizations collect data related to business, trade, prices, 
consumption, production, industries, income, health and population etc. These 
publications are very powerful sources of secondary data. Central Statistical 
Organisation (C.S.O.), National Sample Survey Organisation (N.S.S.O.), 
Office of the Registrar General and Census Commissioner of India, Directorate 
of Economics and Statistics and Labour Bureau, Ministry of Labour are a few 


government publications. 


International Publications: Various governments in the world and 
international agencies regularly publish reports on data collected by them on 
various aspects. For example, U.N.O.’s Statistical Year Book and 


Demography Year Book etc. can be named in this category. 


Semi-official Publications: Local bodies like District Boards, Municipal 
Corporations publish periodicals providing information about vital factors 
like health, births and deaths etc. 


Reports of Committees and Commissions: At times State and Central 
governments appoint committees and commissions with a specific reference to 
study a phenomenon. The reports of these committees and commissions provide 
important secondary data. For example, Kothari Commission report on 
education reforms, report of National Agricultural Commission, Wanchoo 


Commission report on Taxation and Pay Commission reports ete. 


Private Publications: The following private publications may also be 


enlisted as the source of secondary data: 


Journals and Newspapers: Eastern Economists, Monthly Statistics of 
Trade, Financial Express, Economic Times are some of the Journals and 
Newspapers which regularly collect and publish data on various aspects of 


business, economics, commerce and trade. 
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(b) Research Publications: A number of research organizations University 
departments and institutes like Indian Statistical Institute (I.S.I.) Calcutta 
and Delhi, I.C.A.R., N.C.E.R.T., I.C.M.R., etc. also contribute significantly 
to the availability of secondary data. 


(c) Publications of Business and Financial Institutions: A number of business 
and financial institutions like Chamber of Commerce and Trade Association, 
Institute of Chartered Accountants, Sugar Mill Association, Stock Exchanges, 
Trade Unions and Co-operative societies etc. also contribute significantly for 
the availability of secondary data in the related areas. 


(d) Articles: Market reviews and reports also provide data for analysis. 


2. Unpublished Sources: In some cases data are collected but these are not put in 
published form. For example research scholars in the institutes and Universities, 
trade associations and labour bureaus do collect data but they never put it in 
published form. Still, the data from these sources may be used when needed. 


1.11 Limitations of Secondary Data 
[B.C.A. (Meerut) 2009] 


1. There is a possibility that proper procedure might not have been followed in their 
collection. 


These may not be relevant in the present contest. 
These may not be free from personal bias and prejudices. 


These may not have the needed accuracy or reliability. 


gi ae a 


These may not be adequate. 


1.12 Precautions in Accepting the Secondary Data 
[B.C.A. (Agra) 2006] 


T The authority which had collected the data should be reliable and dependable, 
generally, a government or a semi government agency is more dependable than a 
private one. 


2. The method of collection, including the sampling technique, should be acceptable 
to the nature of the enquiry in hand. 


3: The primary and the secondary purposes should have some uniformity with 
respect to the aim and objects of the study. 


4. The interval between the periods when a data was originally collected to the time 
when it is going to be used should not be much. 


5. The facts on which the original data was collected should remain relevant to the 
present problem too. 


Cis ) Elements of Statistics 


1.13 Advantages of Secondary Data 
I: Economy: The time, money and labour is saved in collection of secondary data 


2. Reliable: These are more reliable because these are primarily collected by able and 
efficient researchers. 


3: Flexibility: They can be change according to need. 


4. Accuracy: A high level of accuracy is found in these data because these are 
collected by efficient persons. 


5. Useful for quick and generally for quick and regular informations. 


à Solved Examples "1 


Example 1: What are the advantages as well as the drawbacks of the study of business problems. 
[B.C.A. (Rohilkhand) 2008; B.C.A. (Agra) 2008, 2005; 
B.C.A. (Meerut) 2007, 2006; B.C.A. (Kanpur) 2004, 2002] 


Solution: In the mid of 20% 


history of modern time is finally written, we shall read it as beginning with the age of 


, a top American business executive wrote that “When the 


stream and then progressing through the age of electricity to that of statistics”. 


The use of statistical methods in analysis of sales and of accounting records is an old 
through. In business today, an increasing amount of time and effort is devoted to fact 
finding operations which costs less of money as well. This is done scientifically by means 
of the sampling methods. The sampling methods are of particular importance in market 
research, quality control of products and in forecasting of demand costs. Sample surveys 
of consumer apptitude toward products and services are routine methods in market 
research. Manufactures use statistical quality control to reduce the variations in the 
quality of the finished product. The use of statistical methods helps the business 
management in the making proper discussions in complex situations. Production, sales, 
purchase and finance-all needs statistical methods. A business involves quantitative 
information will sufficient variation in them. So use of statistical methods is a must 
there. 


Drawbacks: The statistical methods being quantitative total for analysis cannot always 
be consider as ideal. They do not remain the substitutes to the matured common sense 
statistical methods are only approximates and thus, are not advised to be used where 
perfect accuracy is wanted. Additionally statistical conclusions are applicable only in the 


group of observations and not with individual cases. 
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Example 2: “Statistics are numerical statements of facts, but all facts numerically stated are not 
statistics” comment ? 


Solution: This is, rather, a genuine statement. Statistics does not end can not deal with 
an event or is isolated case. That the height of a particular individual is 5’, Milton was 
blind. Nepolean was a great warrior are examples of statements that do not come with in 
the preview of statistics. The wage earned by an individual at a time, taken as an 
unrelated single unit, is not a statistical datum while wages earned by him over a period 
of time, taken as a group of related wages, are the different values of the same 
characteristics and are obviously related and obtain by some repetitive operation. 
Observation manufactured product illustrates the former while recording of weights of 


some college students will involved later type of operations. 
Example 3: Elucidate the following statements. 
Solution: 


(i) | We know that statistics does not deal with a single object or an single event 
standing along, put it pertains to the characteristics of an aggregate of objects 
called population. The yields of some specific plot is say 10,000 kgs. It is not for 
statistical study unless, supplemented by some similar other informations like 


yields from other identical plots or of same plot recorded in the previous years etc. 


(ii) It is not doubt, the first and an indispensable step of a statistical survey that the 
data collected be reduced or condensed for its subsequents. But statistical 


operations can not be taken as finished at this stage. 


(iii) We know that statistics deals with quantitative or numerically expressed 
quantitative characteristics. The amount of these characteristics will naturally vary 
from one object to another thus a set of varying observation remains the subject 
matter of statistics. There is no other science to deal with such data that is why 


statistics is branded as a science for studying variations in the variable values. 


Example 4: The science of statistics is essentially a branch of applied mathematics and may be 
regarded as mathematics applied to observational data” Elucidate it. 


Solution: Statistics in mathematics applied observational data. Statistics deal 
with problems involving empirical or observed evidences about the population 
characteristic. These evidences, when expressed numerically, are called statistical 
operations, namely, its treatment, summarisation, analysis and of drawing inference are 
undertaken. All these steps from the subject matter of statistics and are mathematical in 
nature. Thus statistics remains the mathematics applied to the collection data. However, 


the collection of data in itself to is a statistical job. 
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The science of statistics is a branch of applied mathematics. 


The advanced statistical methods are having their roots in the theory of probability 
which in turn has its foundation in the theory of measure and integration while they are 
the subject matter of mathematics. That is why statistics is taken as branch of applied 
mathematics. Also the statistical techniques, which are so frequently used in solving the 
problems of various science of our daily life, are based on different mathematical models 


and assumptions and remain mathematical in character. 


Thus statistical problems embrances the practical application of mathematics involving 


its problems of high utility and of almost value in everyday science and society. 
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10. 


11. 


12. 


13. 


\ Problem Set ` 


What are the various method of collecting statistical data ? Which one is most 
reliable and why ? 


[B.C.A. (Meerut) 2008, 2006] 


Distinguish between primary and secondary statistical data and explain the 
precautions. 


[B.C.A. (Meerut) 2001] 


Classify the methods generally used in the collection of statistical data and state 
briefly their respective merits and demerits. 


[B.C.A. (Delhi) 2004] 
What are requisites of a good table ? 
[B.C.A. (Agra) 2003] 


Discuss various methods of collecting data. What do you mean by primary and 
secondary data ? 


[B.C.A. (Meerut) 2007, 2001] 


What do you mean by a questionnaire and a schedule? State the important points 
to be kept in mind while drafting a questionnaire. 


[B.C.A. (Meerut) 2008, 2007, 2006] 
Define and differentiate between questionnaire and schedule. 
[B.C.A. (Meerut) 2007, 2005] 


Examine critically the important methods of collection of primary data. Can any 
method be called the best under all circumstances ? 


What are the various methods of collecting statistical data. Which of these is most 
reliable and why. 


[B.C.A. (Meerut) 2007] 
Distinguish between primary and secondary data. 


[B.C.A. (I.G.N.O.U.) 2011; B.C.A. (Kanpur) 2008, 2007; 
B.C.A. (Meerut) 2007, 2006;] 


Which of the following statements belong to statistics ? Give reasons. 
(i) | Indians are not hard workers. 

(ii) Iam taller than you. 

(iii) Our birth rate is still a point of worry. 


Illustrate with suitable examples the use of statistical methods in commerce, 
business and industry. 


Write an essay on statistical fallancies. 
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14. Comment on the following statements: 
(i) Statistics is the science of averages. 
(ii) Statistics is the science of counting. 
(iii) We compute statistics from statistics by statistics. 


15. What are the advantages as well as the drawbacks of the study of business 
problems through statistics. 
16. Describe the importance of statistics in trade and commerce. 
[B.C.A. (Avadh) 2007] 
17. State the brief three important causes of distrust of statistics. 
[B.C.A. (Purvanchal) 2006] 


18. 90% of those who drink (wine) due before reaching the age of 75 years. Therefore, 
drinking is bad for longevity. Do you agree ? Analyse. 


[B.C.A. (Bundelkhand) 2008] 
19. State the four main functions of a statistician. 
[B.C.A. (Garhwal) 2011] 
20. Define statistics. Describe its functions and importance. 
[B.C.A. (A.M.U.) 2008, 2005; B.C.A. (Agra) 2008, 2003] 
21. Define statistics and discuss its scope and limitations. 
[B.C.A. (Kashi Vidhyapeeth) 2011, 2009; B.C.A. (Meerut) 2010] 


22. Explain the importance of statistics in trade and commerce. 
[B.C.A. (Agra) 2006] 


23. ‘Statistics are like clay of which you can made a god or devil as your please’ 
comment. 


24. ‘Statistics affects everybody and touches like at many point’ comment. 
[B.C.A. (Purvanchal) 2009; B.C.A. (Rohilkhand) 2009, 2003] 


25. “Statistics is the science of estimates and probabilities’. Give limitations of 


statistics comment on this statement. 
[B.C.A. (Meerut) 2009] 


26. Define statistics and discuss its nature and scope.Describe how it is used in social 
and physical science. 


[B.C.A. (Meerut) 2009; B.C.A. (Avadh) 2009, 2006; 
B.C.A. (Agra) 2008, 2005] 


uun 


, Chapter 


Classification and 
Tabulation of Data 


2.1 Classification 


According to Conner-“Classification is the process of arranging things (either actually or 
nationally) in the groups according to their resemblances and affinities and give expression to the 
unity of attributes that may subsist amongst a diversity of individuals”. Thus, classification is 
grouping of data according to their identity, similarity, or resemblances. For 
example, students in a class may be grouped in respect of sex, age, marital status etc. 


Similarly, letters in the post office are classified according to their destinations viz. Delhi, 
Jaipur, Agra, Kanpur, etc. 


2.1.1 Objects of Classification 
[B.C.A. (I.G.N.O.U.) 2010, 2004; B.C.A. (Kashi Vidhyapeeth) 2008] 


The following are the main objectives of classifying the data: 
(i) It condenses the mass of data in an easily assimilable form. 

(ii) It eliminates unnecessary details. 

(iii) It facilitates comparison and highlights the significant aspect of data. 
( 


iv) It enables one to get a mental picture of the information and helps in drawing 
inferences. 


(v) It helps in the statistical treatment of the information collected. 


(vi) It helps in comparative study. 


(vii) It helps in clarifying the points of similarity and dissimilarity. 
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2.1.2 Types of Classification 

[B.C.A. (I.G.N.O.U.) 2010, 2004; B.C.A. (Kashi Vidhyapeeth) 2008] 
Statistical data are classified in respect of their characteristics. Broadly, there are four 
basic types of classification namely: 
i) | Chronological or Temporal Classification 


( 
(ii) | Geographical or Spatial Classification 
(iii) Qualitative Classification 

(iv) Quantitative Classification 


(i) | Chronological or Temporal Classification: In chronological classification the 
collected data are arranged according to the order of time expressed in years, months, weeks etc. 
The data are generally classified in ascending order of time. For example, the data 
related with population, sales of a firm, imports and exports of a country are always 
subjected to chronological classification. Specifically, the estimates of birth rates 
in India during (1970-79) are: 


Year [1970 1971] 1972] 1973] 1974] 1975 | 1976] 1977 | 1978 |1979 


(ii) Geographical or Spatial Classification: In this type of classification the data are 


classified according to geographical region or place. For instance, the production of paddy 
in different states in India, production of wheat in different countries etc. For 
indicating immediate comparison, the observations are either classified in the 
alphabetical order of the reference places in the order of size of the observation. 
See the following example: 


(a) When names of the countries are in Alphabetical order: 


Yield of Wheat in (kg/acre) 1925 893 | 225 | 439 | s62 | 


(b) When observations are in descending order: 


Yield of Wheat in (kg/acre) L925 Tce 


(iti) Qualitative Classification [B.C.A. (Meerut) 2004, 2002] 


In this type of classification, data are classified on the basis of some attributes or quality like 
sex, literacy, religion, employment, etc. Such attributes cannot be measured along with a scale. 
For example, if the students in a class are to be classified in respect of one attribute, 
say sex, then we can classify them into two classes namely that of males and 
females. 
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(iv) 


Similarly, they can also be classified into ‘employed’ or ‘unemployed’ on the basis 
of another attribute ‘employment’. Thus, when the classification is done with 
respect to one attribute, which is dichotomous in nature, two classes are formed, 
one possessing the attribute and the other not possessing the attribute. This type of 
classification is called simple or dichotomous classification. A simple classification 
may be shown as under: 


Students 


Male Students Female Students 


On the other hand, if we classify students simultaneously with respect to two 
attributes, e.g. sex and employment. Then students are first classified with respect 
to ‘sex’ into ‘Males’ and ’Females’. Each of these classes may then be further 
classified into ‘employed’ and ‘unemployed’ on the basis of attribute 
‘employment’ and as such students are classified into four classes, namely: 


(i) Male employed, (ii) Male unemployed, (iii) Female employed, (iv) Female 
unemployed. Still, the classification may be further extended by considering other 
attributes like marital status etc. 


The classification, where two or more attributes are considered and several classes are formed 
is called a manifold classification. 


Quantitative Classification: Grouped Data or Frequency Distribution 


In Quantitative Classification, the collected data are grouped with reference to the 
characteristics which can be measured and numerically described such as height, weight, sales, 
imports, age, income, etc. The first step in the direction of putting observations in some 
ordered form is to arrange them in ascending or descending order of magnitude. The 
data are then said to be in an array. For example, consider the data consisting of marks 
secured by 50 students in a class as shown in Table 1: 


Table 1: Marks of 50 Students, Ungrouped Data 
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Now, the data in above Table 1 may be arranged in an array as shown in Table 2: 


Table 2: Data from Table 1 Arranged in an Array 


But still, it does not fully eliminate the confusion which may result from a large 
number of marks. Thus, to make out data more comprehensive, we group or 
classify identical observations into ordered class-intervals. Grouping our data into 
ordered classes and determining the number of observations in each of the classes, we form a 
Frequency Distribution. The tabular form of a frequency distribution gives us a 
Frequency Table. 


2.2 Construction of Frequency Distribution 
[B.C.A. (Agra) 2007; B.C.A. (Purvanchal) 2006] 


To illustrate the construction of a frequency table, let us again consider the data in Table 
(1) which represent the marks of 50 students in an examination. Here, as a first step, we 
decide about the number of classes into which data are to be grouped. This is done 
arbitrarily, although we are guided by the number of observations to be grouped. 
Ordinarily, the number of classes should be between 5 and 20. The number of classes 
would also depend on the number of observations; with larger number of observations 
one can have more classes. An advantage of a large number of classes is that it leads to 
better accuracy in the computation of statistical measures like mean etc., but if there are 
too many classes, it defeats the purpose of grouping. On the other hand, too few classes 
would be sacrificing too much information and would lead to more errors in calculations 
for mean etc. 


Another factor used for determining the number of classes is the size, width or range of 
the class usually called class-interval denoted by A4. Class-intervals should, to the extent 
possible, be of uniform size or width. They are easier to comprehend and make the 
calculations easier. However, there may be exceptions to this rule. The school children, as 
an example, may be grouped in age classes 6-13, 14-17 and 18-22 of width 8, 4 and 5 
respectively. These groups or classes correspond to certain levels of education, e.g. 
elementary, secondary and higher education levels. Further, the width of the class should 
be a whole number conveniently divisible by 2, 3, 5, 10 or 20. Now, after deciding about 
the number of class intervals, say 10, we calculate the range R (highest marks minus 
lowest marks) of marks to be grouped. From data in Table 1, the range of marks is 


R=88-21=67 
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Now apply the following formula to get the approximate number of classes in which 
observations are to be grouped: 


Number ddug ao mnb g (1) 
h Class -interval 10 


Thus, in the grouping of data in Table (1), one should expect to have 6.7 or 7 classes in 
whole number. Formula (1) can also be used to decide about the class-interval, if we 
know the range of observations and the number classes, as 


Class interval = h = a sz(2) 
No. of classes 


Having determined the class-interval, one must decide where to start the classes. Since 
for data in Table 1, the lowest score is 21, so we might begin with 20 as it is common to 
let the first class start with a number which is a multiple of class interval. Now there are 
three following ways of describing class limits for different classes. 


1. Exclusive Method 

2: Inclusive Method 

3. True, Actual Class Limits or Class Boundaries 

l. Exclusive Method: In exclusive method of class formation, we add the class 
interval 10 to the lower limit of the lowest class to find the upper limit of the class 
as 20+ 10 =30. Thus, our lowest class becomes (20-30). The remaining class limits 
and classes are obtained by adding the class interval to each class limit until we 


reach the seventh class as (80-90), which contains the highest score of 88 in data in 
Table 1. The seven exclusive classes thus formed are shown in Table 3: 


Table 3: Different Types of Classes 


Exclusive Inclusive True or Actual Class Limits or 
Classes Classes Class Boundaries 
20—30 19.5—29.5 


= 2 
300 29.5395 
1050 595195 
= 5 


3090 195-89: 


Thus, in exclusive method of class formation, classes are so formed that the upper limit of 


9 
9 
9 
9 
9 
9 


one class is the lower limit of the next class and therefore, this method of classification ensures 
continuity between two successive classes which is essential for most of statistical calculations. 
However, one should note that in exclusive classes it is presumed that the score or 
observation equal to the upper limit of the class is exclusive, e.g. a score of 30 will be 
included in the class 30-40 and not in 20-30. 


_— 
C 28 ) Elements of Statistics 


Inclusive Method: Unlike exclusive classes, inclusive classes include scores or 


observations which are equal to upper limit of the class. In the formation of such classes 
we start with the lower limit 20 of the scores for the first class and then the lowest 
class is formed as (20-29) so as to include 10 marks (10 being the class interval). 
These 10 marks are 20, 21, 22, ... 28, 29. The remaining six classes are obtained by 
adding the class interval to each class limit of the previous class until we get the 
highest class as (80-89), inclusive classes so obtained are also given in Table 3. 


From the preceding discussion it should be clear that exclusive method should be used when 
data are of continuous nature or have been measured infractions of unit. Inclusive way of 
forming classes may be preferred when measurements on the variable are given in whole 


numbers. 


True or Actual Class or Limits or Class Boundaries: In inclusive method of 
class formation, it was observed that the upper class limit is not equal to lower 
class limit of the next class and so there is no continuity between classes. 
However, for most of statistical calculations it is desirable that classes be 
continuous. For overcoming this difficulty we assume that an observation does 
not just represent a point on a continuous scale, but an interval of unit length of 
which the given observation is the middle point. For example, a score of 20 in an 
examination represent the interval 19.5 to 20.5 on a continum. Similarly, a score 
of 29, then, is representable by the interval 28.5 to 29.5. Thus, the mathematical 
meaning of an observation is an interval which extends from 0.5 unit below to 0.5 unit 
above the face value of an observation on a continum. These class limits of an observation 
are termed as true or actual class limits. Thus, the true class limits for the class 
20-29 becomes 19.5-—29.5. The true limits for other inclusive classes are also 
shown in Table 3. 


After forming the classes, finally we count the number of observations falling in 
each class and record the appropriate number in frequently (denoted by f) 
column. The number of observations falling in each class is termed as class frequency. To 
facilitate counting these frequencies, we also prepare a tally chart with ‘tally bar’ 
column as shown in Table 4: 


Table 4: Data from Table 1 Group in Frequency Table (Exclusive Method) 


Marks Classes Tally Bar Frequency (f) 
30—40 Car ooo 
40—50 T a | 7 
50—60 


60-70 CL ae 
70—80 
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In Table 4, four tally bars are marked for four marks 21, 25, 25 and 29 which lie 
within the class (20-30), similarly six bars are marked for six scores lying in the 
class (30-40) and so on. In this way, the raw or ungrouped data in Table | are 
arranged in the form of a frequency distribution showing classes and the 
corresponding class frequencies. The total of all the frequencies is denoted by N. 
The frequency table in Table 4 gives the tabular form of this frequency 
distribution. Similarly, the data in Table 1 can be grouped by using inclusive and 
true class limits. Such a frequency table is given in Table 5: 


Table 5: Raw Data from Table 1 Grouped into Frequency Table 
(Inclusive Method) 


Coe [sses Pas fe 


80—89 79.5—89.5 


Total 


In the process of forming a frequency distribution, we observe that individual observations lose 
their identity when grouped into classes. We need, therefore, a point to make some assumption 
about the location of all the observations included in a class. On the assumption that 
grouped observations are more or less uniformly distributed between class limits, a 
point mid-way between class limits can be chosen as representative of all the 
observations in that class. This point is called the mid-point of the class and can be 
obtained by averaging the lower and upper class limits as follows: 


lower class limit + upper class limit 


Mid-point of the class = 7 


Thus, the mid-point of the class (20-29) will be (20+29)/2=24.5. The mid-points 
of the remaining classes are shown in Table 5. 


Now, we are in a position to list the steps in grouping a large set of data into a 
frequency distribution. 
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2.2.1 Steps in the Construction of Frequency Distribution 
(i) Find the smallest and largest observations in the raw data and find the range R. 

(ii) Decide on the number of classes. 

(iii) Divide range by the number of classes to estimate approximate size of the interval. 


(iv) Find the lower class limit of the lowest class and add to it the class interval to get the upper 
class limit. 

(v) Obtain class-limits for the remaining classes by adding the class interval to the limits of the 
previous class. 


(vi) Count number of frequencies in each class and check against the total number of observations. 


2.3 Cumulative Frequency Distribution 

[B.C.A. (I.G.N.O.U.) 2009; B.C.A. (Kanpur) 2008, 2006] 
Sometimes we may be interested to know the number of observations less than or more 
than a specified value. The principal, for example, may be interested in knowing the 
number of students scoring above 60 or below 30 marks. Such results can be easily 
obtained from a cumulative frequency distribution which can be derived from a simple 
frequency distribution by merging successive class-intervals until all have been 
cumulated. Such a cumulation procedure may be of two types resulting in two types of 
cumulative frequencies, namely: 


2.3.1 ‘Less than’ Cumulative Frequency Distribution 


By adding the frequencies of all observations less than ‘true’ upper limit of a class, we get ‘less than’ 
cumulative frequency. Thus, for calculating these frequencies ‘true’ upper limit of a class 
is regarded as the reference point. ‘Less than’ cumulative frequency table for the grouped 
data of Table 5 is shown in Table 6: 


Table 6: Data from Table 5 Grouped into ‘Less than’ and ‘More than’ 
Cumulative Frequency Table 


Classes Frequency Classes Cumulative Frequency 


ma 
a 
Leathan |446+8+12+9+7+4=50 | More than 
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2.3.2 ‘More than’ Cumulative Frequency Distribution 


For ‘more than’ cumulative frequency true’ lower limit of the class is taken as the reference point. By 
adding the frequencies corresponding to more than ‘true lower limit’ of a class, we get 
‘more than’ cumulative frequency. ‘More than’ type cumulative frequency table 
corresponding to data in Table 5 is also given in Table 6. 


2.4 Relative Frequency Distribution 


Sometimes it is desirable to express the frequencies in a class in the form of proportion or 
percentage of the total frequency. This may be useful for comparing the distributions of 
frequencies for two sets of data. The relative frequency of a class is obtained by dividing the 
class frequency by the total frequency. A table showing relative frequencies in each class is accordingly 
called as relative frequency table. If we multiply each relative frequency by 100, we get a 
percentage distribution of the frequencies. The relative and percentage frequency table 
for the data in Table 5 is given in Table 7: 


Table 7: Relative and Percentage Frequency Table 


.14 x100 =14 
.08 x100 =8 
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Solved Examples 


Example 1: The marks of 80 students in mathematics are given below. Using inclusive classes, 


prepare a frequency table by taking the class width as 5. 


Solution: As a first step in preparing frequency table, we decide the range of marks as 
R = the highest marks — the lowest marks = 97 —53 = 44. 


Since the lowest mark is 53, we start with the lower limit 50 of the first class so that, with 
a class width of 5, the lowest class is (50 —54) as it includes 5 marks (50, 51, 52, 53 and 
54). The remaining classes can be obtained by adding the class width to each class limit 
of the previous class until we get the highest class (95 —99) which includes the highest 
marks 97. Finally, using tally bar, the class frequencies can be recorded as shown in the 
frequency table. 


Frequency Table 8: Showing Distribution of Marks of 80 
Students in Mathematics 
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Example 2: Mid-points in a frequency distribution of age of a group of persons are 18, 25, 32, 
39, 46, 53, 60 and 67. Find 


(i) the size of the class 
(ii) the class limits, assuming that the age quoted is the age completed last birthday 


(iii) the class boundaries [B.C.A. (I.G.N.O.U.) 2009, 2006] 


Solution: 


Table 9: Showing Class-Limits and Class Boundaries 
for the given Mid-Point 


50—56 49.5—56.5 
57—63 56.5—63.5 
64—70 63.5—70.5 


(i) Since given mid-points have a uniform difference, so class intervals too will be of 


uniform size. The size of the class is the difference between the mid-values of any 
two consecutive classes, which is 32 —25 =7 in the present case. 


(ii) In case the age quoted is taken as the age completed on last birthday, the variable 
‘age’ assumes only integral values. Thus the ‘inclusive classes’ of width 7 with 
above mentioned mid points are to be defined. Therefore, the class-limits for an 
inclusive class of size 7 having a mid-point 18 will be (18 — 3,18 + 3)ie. (15 — 21). In 
the same way, the class-limits for the remaining mid-points can be obtained as 
shown in the table. 


(iii) The class boundaries for different classes can be obtained by adding and subtracting 
half the size of the class from the corresponding mid-values. Here half of the size of 
class is 7/2=3.5. Thus, for a class with mid-value 18, the class boundaries will be 
(18 —3.5,18+3.5), ie. (14.5 —21.5). Following the same way, the class boundaries 
for the remaining classes can be obtained as shown in the table. 
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Example 3: Prepare ‘Less than’ and ‘More than’ cumulative frequency distributions from the 


following distribution. Also obtain relative frequency table: 


Wages (®) 240-250 | 250-260 | 260-270 | 270-280 | 280-290 | 290-300 


Solution: ‘Less than’ and ‘More than’ cumulative frequency distributions for the above 
simple frequency distribution are shown in the following table: 


‘Less than’ and ‘More than’ Cumulative Frequency Distributions 


‘Less than’ Cumulative Frequency ‘More tha 


n’ Cumulative Frequency 
Distribution Distribution 
More than 
More than 
= 100-5 =95 
More than 
= 100 -5-10 =85 
5+10+18+27=60 Meore them ings 10 e607 
270 
5+10+18+27+35=95 G a OO- 0 E-270 
Less than | 5410+18+27+35+5 =100 | More than | 100-5-10-18-27-35=5 
300 290 


We know that the relative frequency for a class is obtained by dividing the class 


frequency by the total frequency. The relative frequency distribution so obtained is given 
in the table below: 


The Relative Frequency Distribution of 100 Workers 
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obtain a discrete frequency table. [B.C.A. (Avadh) 2006] 


Solution: Here, the variable (X), the number of children, assumes the values 0, 1, 2, 3, 
4 and 5. Counting frequencies for each value we get the needed discrete frequency table. 


Discrete Frequency Distribution of No. of Children in 50 Families 


x oh 
No. of Children No. of Families 


Example 5: Ina survey, it was found that 50 families bought milk in the following quantities in a 
particular month. Classify the following data in an individual discrete, continuous and cumulative 
frequency series: 


[B.C.A. (Rohilkhand) 2011, 2005. 2003; B.C.A. (Bundelkhand) 2010, 2006, 2004; 
B.C.A. (Rohtak) 2008, 2004] 
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Solution: 


(i) Individual Series: In the ascending order of items 


5 
5 
6 
7 
7 
8 
9 
9 


(ii) | Discrete Series: In the ascending order of the items 


Quantity of | Tally Bar | Frequency | Quantity of | Tally Bar | Frequency 
Milk Milk 


a es 
ie 
nS] 
[a] 
i 
[ti] 


p 


— | = 
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-- 
N| N 
$ 
PF 


N 


(iii) Frequency Distribution: 
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(iv) Cumulative Frequency Distribution: 


a a a 
oo [8 | tes ton | 
18 Sy) 


er ee 
[tana | a0 | tewten so | 0 


Example 6: Prepare a frequency series of the ages of 25 students of class B.B.A. in your school. 
15, 16, 16,17, 18,17, 15, 15, 16, 16,17, 15,16, 16, 15, 16, 16,17, 17, 18,19, 16,15 


Solution: In this question, minimum age is 15 years and the maximum age is 19 years. 
Therefore, the distribution of frequencies would correspond to the values between 15 
and 19. 


Frequencies Series 


III 


Example 7: We have the following data on the monthly expenditure of food (in X) for 30 
households in a locality. 
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(i) Obtain a frequency distribution using following class interval 100-150, 150-200, 
200-250, 250-300, 300-350. 


(ii) What percentage of households spends less that % 250 per month and what 
percentage of households spends more than ~ 200 per month ? 


[B.C.A. (Kurukshetra) 2012] 


Solution: 


(i) | The data may be presented in the form of an exclusive series; 


owa | a oe 


200-250 


il 


Total Lf =30 


(ii) No. of households spending less than ¥ 250 per month =4 + 6 +13 =23 


Percentage of households spending less than % 250 per month = < x100 =76.6% 


No. of households spending more than Ẹ 200 =13 +5 +2 =20 


Percentage of households spending more than Ẹ 200 = = x 100 =66.5% 


2.5 Three Forms of Data 


Going through the process of Quantitative Classification we observe that statistical data 
can be put in the following three forms: 


I; Raw data, ungrouped data or individual series. 
2. Discrete frequency distribution. 


3. Continuous frequency distribution. 
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Problem Set-l 


Define classification and explain various ways of classification. 


Discuss the importance of classification in statistics. 
[B.C.A. (Meerut) 2008, 2007] 


Explain the general principles of classification of data for forming a frequency 
distribution. 
What is meant by ‘Classification’ ? State its objectives. Briefly explain different 


methods of classifying statistical data. [B.C.A. (Agra) 2006] 


Explain the following terms: 
i)  Class-interval 
ii) Class-frequency 


iii) Class-limits 


( 
( 
( 
Gv) Class mark or class mid-point 
(v) Frequency distribution 

(vi) Cumulative frequency table 

(vii) Relative frequency table 

Discuss various steps in the construction of a frequency distribution from raw 


data. [B.C.A. (Meerut) 2007, 2005] 


Distinguish between: 
(i) Simple and manifold classification 
(ii) Inclusive and exclusive methods of class formation 


(iii) “Less than’ and ‘More than’ cumulative frequency distributions. 


Prepare a discrete series from the following data: 


If class mid-points in a frequency distribution of a group of persons are 25, 32, 39, 
46, 53, 60, 67, 74 and 81, find 


(i) Size of class interval, and (ii) The class boundaries. 
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10. 


I1. 


12. 


13. 


10. 


Classify the following by taking class-intervals such that their mid-values are 17, 
22,27, 32 and so on: 


What are the objects of classification of data ? Discuss the different methods of 


classification. 


Distinguish between classification and tabulation. Discuss the purpose, methods 
and importance of classification. 


Find the frequency table having grades of wages with class interval of Rs. 20 each 
from the following data of daily wages received by 30 workers in a certain factory 


daily wages in rupees are : 


140, 160, 140, 220, 130, 150, 240, 120, 230, 140, 200, 170, 210, 220, 180, 180, 
190, 200, 270, 160, 150, 110, 120, 210, 200, 170, 180, 190, 230 


Answers-] 


EPP PE EEE ESSE esp pan 
f rt f2fats[sf2{3fafe]s]zfafefs|s[s| | 
) F 


(i 


(ii) 21.5-28.5,  28.5-35.5, 35.5-42.5, 42.5-49.5, 49.5-56.5, 56.5-63.5, 
63.5-70.5, 70.5-77.5 77.5-84.5 


Classification and Tabulation of Data Ca) 


2.6 Tabular Presentation 


2.6.1 Table and Tabulation [B.C.A. (Meerut) 2009] 


Tabulation is the process of summarizing classified or grouped data in the form of a table so that it 
is easily understood and an investigator is quickly able to locate the desired information. A table is a 
systematic arrangement of classified data in columns and rows. Thus, a statistical table makes it 
possible for the investigator to present a huge mass of data in a detailed and orderly form. 
It facilitates comparison and often reveals certain patterns in data which are otherwise 
not obvious. Classification and Tabulation as a matter of fact, are not two distinct 
processes. Actually they go together. Before tabulation, data are classified and then 
displayed under different columns and rows of a table. 


2.6.2 Advantages of Tabulation 

Statistical data arranged in a tabular form serve following objectives: 

(i) It simplifies complex data and the data presented are easily understood. 
(ii) It facilitates comparison of related facts. 


(iii) It facilitates computation of various statistical measures like averages, dispersion 
correlation etc. 


(iv) It presents facts in minimum possible space and unnecessary repetitions and 
explanations are avoided. Moreover, the needed information can be easily located. 


(v)  Tabulated data are good for references and they make it easier to present the 
information in the form of graphs and diagrams. 


2.6.3 Preparing a Table [B.C.A. (Meerut) 2009] 


The making of a compact table is itself an art. This should contain all the information 
needed within the smallest possible space. What the purpose of tabulation is and how the 
tabulated information is to be used, are the main points to be kept in mind while 
preparing for a statistical table. An ideal table should consist of the following main parts : 


(i) | Table Number: A table should be numbered for easy reference and identification. 
This number, if possible, should be written in the centre at the top of the table. 
Sometimes it is also written just before the title of the table. 


(ii) Title: A good table should have a clearly worded, brief but unambiguous title 
explaining the nature of data contained in the table. It should also state arrangement 
of data and the period covered. The title should be placed centrally on the top of a 
table just below the table number (or just after table number in the same line). 


(iii) Captions or Column Headings: Captions in a table stand for brief and 
self-explanatory headings of vertical columns. Captions may involve headings and 
sub-headings as well. The unit of data contained should also be given for each 
column. Usually, a relatively less important and shorter classification should be 
tabulated in the columns. 
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(iv) 


(v) 


(vi) 


(vii) 


Stubs or Row Designations: Stubs stands for brief and self-explanatory headings 
of horizontal rows. Normally, a relatively more important classification is given in 
rows. Also a variable with a large number of classes is usually represented in rows. 
For example, rows may stand for score classes and columns for data related to sex of 
students. In the process, there will be many rows for score classes but only two 
columns for male and female students. 


Body: The body of the table contains the numerical information of frequency of 
observations in the different cells. This arrangement of data is according to the 
description of captions and stubs. 


Footnotes: Footnotes are given at the foot of the table for explanation of any fact 
or information included in the table which needs some explanation. Thus, they are 
meant for explaining or providing further details about the data, that have not 
been covered in title, captions and stubs. 


Sources of Data: Lastly one should also mention the source of information from 
which data are taken. This may preferably include the name of the author, volume, 
page and the year of publication. This should also state whether the data contained 
in the table is of primary! or secondary” nature. 


A model structure of a table is given below: 


Model Structure of a Table 


«—— Table Number ________ p 


«— Title of The Table —————_» 


Stub Heading Caption Headings 
LE Caption Sub-Headings 


— Stub Sub-Heading — 


a 


Foot Note : il. 2: 


Source Note i I; 2; 
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2.6.4 Requirements of a Good Table 


A good statistical table is not merely a careless grouping of columns and rows but should 


be such that it summarizes the total information in an easily accessible form in minimum 


possible space. Thus, while preparing a table, one must have a clear idea of the 


information to be presented, the facts to be compared and the point to be stressed. 


Though, there is no hard and fast rule for forming a table yet a few general points should 


be kept in mind. 


(i) 
(ii) 
(iii) 


(iv) 


(vii) 


(viii) 


(ix) 


A table should be formed in keeping with the objects of a statistical enquiry. 
A table should be carefully prepared so that it is easily understandable. 


A table should be formed so as to suit the size of the paper. But such an adjustment 
should not be at the cost of legibility. 


If the figures in the table are large, they should be suitably rounded or 
approximated. The method of approximation and units of measurements too 
should be specified. 


Rows and columns in a table should be numbered and certain figures to be stressed 
may be put in ‘box’ or ‘circle’ or in bold letters. 


The arrangement of rows and columns should be in a logical and systematic order. 
This arrangement may be an alphabetical; chronological or according to size. 


The rows and columns are separated by single, double or thick lines to represent 
various classes and sub-classes used. The corresponding proportions or percentages 
should be given in adjoining rows and columns to enable comparison. A vertical 
expansion of the table is generally more convenient than the horizontal one. 


The averages or totals of different rows should be given at the right of the table and 
that of columns at the bottom of the table. Totals for every sub-class too should be 
mentioned. 


In case it is not possible to accommodate all the information in a single table, it is 
better to have two or more related tables. 


2.6.5 Type of Tables 


Tables can be classified according to their purpose, stage of enquiry, nature of data or 


number of characteristics used. On the basis of the number of characteristics, tables may 


be classified as follows: 


(i) 


Simple or One-Way Table: A simple or one-way table is the simplest table which 
contains data of one characteristic only. A simple table is easy to construct and simple 
to follow. For example, the adjacent blank table may be used to show the number 
of adults in different occupations in a locality. 
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(ii) 


(iii) 


The Number of Adults in Different Occupations in a Locality 


oe 


Two-Way Table: A table which contains data on two characteristics is called a two-way 
table. In such case, therefore, either stub or caption is divided into two co-ordinate 
parts. In the given table, as an example, the caption may be further divided in 
respect of sex. This sub-division is shown in the adjacent two-way table which now 
contains two characteristics namely, occupation and sex. 


The Number of Adults in a Locality in respect of Occupation and Sex 


| n “ | 


Total 


Manifold Table: Thus, more and more complex tables can be formed by including 
other characteristics. For example, we may further classify the caption sub-headings 
in the above table in respect of ‘marital status’, ‘religion’ and ‘socio-economic status’ 
etc. A table in which more than two characteristics of data are considered, is called a manifold 
table. For instance, table below shows three characteristics, namely, occupation, sex 
and marital status. 


The Number of Adults in a Locality in respect of Occupation, 


Sex and Marital Status 


Occupations No. of Adults Total 


Foot note — ‘M’ stands for married and ‘U’ for unmarried. 


Manifold tables/though complex, are good in practice as these enable full 
information to be incorporated and facilitate an analysis of all related facts. Still, as 
anormal practice, not more than four characteristics should be represented in one 
table to avoid confusion. Other related tables may be formed to show the 
remaining characteristics. 
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Example 8: Prepare a blank table to show the distribution of students in a college according to 
marks, sex and two sections. 


Solution: Here three characteristics are to be simultaneously represented in a table and 
so either a row or a column heading may be sub-divided to show the third characteristics. 
As a rule, sub-division of a column heading is advisable. The needed blank table thus 
takes the form. 


Distribution of Students according to Marks, Sex and Section 


Marks 
Classes 


Example 9: In a sample study about coffee drinking habits in two towns the following information 
was recorded: 


Town A ‘Females were 40%, total coffee drinkers were 45% and male non-coffee drinkers were 
20%. Town B: Male were 55%, male non-coffee drinkers were 30% and female coffee drinkers were 
15%. Represent the data in a tabular form. 


Solution: In such questions, we first form a blank table showing the needed 
classification. Then we fill up the cells with given frequencies which have been shown by 
bold letters in the following table. The remaining frequencies can be easily obtained by 
adjusting the marginal totals. For example, in town A, the male percentage can be 
obtained by subtracting female percentage from 100, i.e. 100 -40 =60. Similarly, the 
percentage of male coffee drinkers can be obtained by subtracting the percentage of male 
non-coffee drinkers from 60, i.e. 60 — 20 = 40. The remaining frequencies in other cells are also 
obtained in the same. 
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Way-Table Showing the Coffee Drinking Habits of Town A and B 
(Figures in Percentage) 


Drinking 


Habit 


Coffee drinkers 
Non-Coffee drinkers 


Example 10: Prepare a blank table showing the distribution of students of a college according to 
age, sex, class and residence for arranging physical training. 


Solution: Table showing the distribution of students of a college according to age, sex, 
class and residence: 


Urban Rural Urban 


ef 
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2.7 Difference between Classification and Tabulation 
[B.C.A. (A.M.U.) 2011] 


The main objective of both classification and tabulation is to present the collected data 


in a summarised and compact form but there are few differences which are as follows: 


l. Difference in Sequence: From the view of sequence in statistical inquiry, 
classification is done first and tabulation later i.e., classification forms the basis for 


tabulation. 


2. Difference in Presentation: In classification, data are classified into different 
classes according to their similarities and dissimilarities, while in tabulation the 
classified data are placed in columns and rows. Thus, tabulation is an mechanical 


function of classification. 


3. Difference in Method: Classification is a method of statistical analysis, while 


tabulation is a method of presentation of data. 


4. Basis of Presentation: In classification, data are classified into classes and 
sub-classes while in tabulation data are presented under headings and 


sub-headings. 


5. Use of Data: In classification only original data are used, while in tabulation 


derivatives like percentages. 


— 
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h Problem Set-2 1 


Define table and tabulation. [B.C.A. (Meerut) 2008, 2006] 
Discuss the importance of tabulation. [B.C.A. (Rohilkhand) 2009, 2008, 2006] 
State clearly the essentials of a good table. 

Write a note on various parts of an ideal table. [B.C.A. (Agra) 2007, 2005] 
Define ‘Caption/and Stubs’. 

Define a two-way table with example. 

Define a manifold table with example. 


Distinguish between ‘Classification and Tabulation’. 
[B.C.A. (Meerut) 2008, 2007, 2006] 


Prepare a blank table to show the number of employees in a firm according to: 
(i) Sex: male and female 
(ii) Age-Groups: below 25, 25-35, 35-45 and above 45. 


Construct a blank table showing the distribution of workers in a factory according 
to: 

(i) Religion: Hindus, Muslims, others. 

(ii) Age Groups: 0-30, 30-50, 50 and above. 
(iti) Skill: Skilled and Unskilled. 

(iv) Sex: Male and Female. 


Write a note on importance of classification and tabulation. 
[B.C.A. (Meerut) 2008] 


What is statistical table ? Mention of rule of construction of a table. 
[B.C.A. (Avadh) 2009, 2007] 


OOO 


Chapter 


Frequency Distribution and 
Graphical Representation 


3.1 Display Data 
There are numerous ways of displaying statistical data pictorially. Some of the main forms of 
such visual presentation are: 
1. Diagrammatic Presentation 
2 Graphical Presentation 
3. Pictures 
4 Maps 


3.1.1 Diagrammatic Presentation 

In a diagrammatical presentation of statistical data, we make use of geometric figures 
like bars, squares, rectangles, circles etc. Data classified according to some chronological 
order, geographical region or attributes is generally presented by diagrams and pictures. 
Maps are generally used for geographical presentation of statistical data. Graphical 
presentation, a subject matter of the next chapter, is used for the pictorial representation of a 
frequency distribution or a cumulative frequency distribution. 


3.1.1.1 Importance of Diagrams and Graphs 
i) They are attractive and impressive. 
ii) They are even appealing to a common man. 


( 

( 

(iii) They facilitate comparison. 

(iv) They have universal acceptability. 
( 


v) They are useful for determining some statistical measures of data like median, 
mode, quartiles and other partition values. 
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3.1.1.2 Rules for Constructing Diagrams 


There are following rules for constructing diagrams: 
(i) Every diagram should be suitably and briefly titled. 

(ii) It should suit to the size of the paper. 

(iii) It should be neat and attractive. 

(iv) It should be clearly indexed. 

(v) Diagrams should contain foot notes and a proper spacing between the figures. 
(vi) The details in the diagram should be self explanatory. 


3.1.1.3 Type of Diagrams 
(A) One-Dimensional Diagrams 


One-dimensional diagrams also called Bar-diagrams, are most widely used 
diagrams for the visual presentation. Bar-diagrams of are the following types: 


i) Simple Bar-Diagrams 

ii) Multiple Bar-Diagrams 

iii) Sub-Divided Bar-Diagrams 
iv) Percentage Bar-Diagrams 
v) Deviation Bar-Diagrams 
vi) Broken Bars 


(i) | Simple Bar-Diagrams: A bar is a thick whose thickness is taken only for making 
the diagram attractive, Bar-diagrams are called one-dimensional as only the length 
of the bar represent the magnitude of the value of the variable. Thus, a bar diagram 
consists of a number of rectangles and is used only for one-dimensional 
comparisons. 


Solved Examples 


Example 1: Draw the bar-diagram to represent the following data related to a school: 


1990 OQI 1992 1993 1994 1995 


Present the data with a suitable diagram. [B.C.A. (Meerut) 2011] 


Solution: In this case data can be represented by a bar-diagram as the figures relate to 
one variable only for bar-diagram see Fig. 3.1. 
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Enrolment (1990-95) 
400 


Bee 


No. of Students 


Years 


Fig. 3.1: Single Base Diagram 


+ Limitations of Bar-Diagram 
A bar-diagram has the following limitations: 


(a) This diagram should be used only when the given figures do-not vary much 
and the proportion between the figures is small. 


(b) This diagram can be used to represent data on one variable only. 


(ii) Multiple Bar-Diagrams: A Multiple-bar diagram is used when a comparison is to be 
made between two or more variables. These are also used for comparing magnitudes of one 
variable in two or three aspects. The technique of drawing such diagrams is the same as 
that of simple bar-diagrams. The difference is that in the present case a group of 
bars representing is placed side by side. The bars are distinguished by shading or 
colours to show the representation of different variable. The following is an 


example of multiple-bar diagram. 


Example 2: Following data relate to the faculty-wise enrolment of students in a college: 


Represent the data by a suitable diagram. 


Solution: In this example, year-wise data on three aspects (Arts, Science and Commerce) 
over years is given. So, a multiple bar-diagram should be used for the purpose. See Fig 3.2. 
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No. of Students 


(iii) 


Faculty-wise Enrolment (1993-95) 


U Arts B Science Commerce 


Fig. 3.2: Multiple-Bar Diagram 


Sub-Divided Bar-Diagram: The sub-divided bar-diagram, also known as components 
bar-diagram, is useful in a situation when it is necessary to show and compare the breakup of one 
variable into several components. In this case, bars are drawn to represent the total or 
aggregate magnitudes of the variable; one bar to represent each time period or 
geographical area. Each bar is then sub-divided in several parts, each part representing 
a component of the aggregate or total. Different shades or colours are used to 
distinguish between the components. Various components of the variable are shown 
in the same order in different bars for easy comparison; the largest component is 
usually indicated on the lower portion of the bar and the smallest component on its 
uppermost portion. For example, the total number of students in a college in different 
years, classified according to sex, may be represented by a sub-divided bar-diagram. 
Following example will further clarify the purpose and the procedure of drawing a 


sub-divided bar-diagram. 


Example 3: Following data relate to_year-wise enrolment in a college, classified according to sex: 


1990-91 1991-92 1992-93 1993-94 1994-95 


No. of Boys 1215 1160 1325 1410 1480 
2025 1985 2169 2190 2300 


Represent the data by a suitable diagram. 
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Solution: The sub-divided bar-diagram to represent the given data is shown in Fig. 3.3. 


Enrolment in College (1990-95) © No. of Boys 
No. of Girls 


No. of Students 


1990-91 1991-92 1992-93 1993-94 1994-95 


Year 


Fig. 3.3: Sub-divided Bar Diagram 


(iv) Percentage Bar-Diagram: The construction of percentage bar-diagram is similar to the 
sub-divided bar chart. The difference between the two is that, in the sub-divided bar 
diagram, the component parts are shown in absolute quantities, while in the 
percentage bar diagram, the component parts are transformed into percentages of 
the total. The advantage of this diagram is that all the bars are drawn with equal 
heights representing a total of 100. These bars are then sub-divided in terms of 


percentages of the components. 


Example 4: Represent the data in Example 2 by a percentage bar-diagram. 
[B.C.A. (I.G.N.O.U.) 2006] 


Solution: For representing the data by a percentage bar-diagram, we obtain the 


percentage of students faculty-wise, for each year as shown in the Table 1. 


Table 1: Percentage of Students, Faculty-wise and for Each Year 


Percentage of Students Total % 
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The data in Table | is represented by a percentage bar-diagram in Fig. 3.4. 
Faculty-wise Percentage 


E Commerce Science Œ Arts 
100%- 


% Age of Students 
Y è D ow 
S Q$ S $ 
xs A A & 


=) 
& 


Fig. 3.4: Percentage Bar-Diagram 


(v) Deviation Bar-Diagrams: Deviation bar-diagrams are used to show the net magnitudes 
of a phenomenon, i.e. net profit, net loss, net exports or imports, etc. Bars in these diagrams 
can assume both negative and positive values. Positive valued bars are shown 
above the base line (x-axis) while negative valued bars are drawn below it. The 


concept has been clarified with the following example. 


Example 5: Depict the following data by a suitable diagram: 
(Balance of Trade = Export - Import) 


Export Balance of Trade (Millions 3) 


[B.C.A. (I.G.N.O.U.) 2006] 


Solution: Deviation bar-diagram representing data on trade balance is shown in Fig. 3.5. 
Balancing Trade (1994-96) 


© Deficit © Surplus 
20 
z 
2 
= 
= 
S 
tv 
1994 1995 1996 
Years 


Fig. 3.5: Deviation Bar-Diagram 


Frequency Distribution and Graphical Representation Css > 


(vi) Broken Bars: Sometimes we wish to represent series having wide variations in 


values, i.e. some values may be very small and others very large. In such a situation 
difficulty arises in choosing a suitable scale to accommodate smaller and larger values 
together. The difficulty is overcome by using broken bars for representing the larger values. 


Example 6: The following data relate to sales in five firms A, B, C, D and E: 


Fn 


Use a suitable bar-diagram to represent the data. 


Solution: Since there is a big gap between the smaller and the larger scale values, so 


broken bars may be used to represent the larger values as shown in Fig. 3.6: 


(B) 


(i) 


300 


Sales (= in Lakhs) 


Firms 


Fig. 3.6: Broken-Bar Diagram 


Two-Dimensional Diagrams or Area Diagrams: In one-dimensional diagrams 
we take only the length of the bars into consideration and width is taken for the 
beautification of the diagrams. However, in two-dimensional diagrams or area 
diagrams, the length as well as the breadth of the geometric figure is taken into consideration. 
Such diagrams are also useful in situations when the proportion between the 
magnitudes of given values of the variable is quite large. Some of the popular 
two-dimensional diagrams are: 


(i) Rectangle Diagrams 

(ii) Squares and Circle Diagrams 

(iii) Pie-diagrams 

Rectangle Diagrams: These diagrams are used for two-dimensional comparisons. 
While drawing the rectangles, we consider the length as well as the width of the 
bars. Thus, rectangles used for area diagram vary in heights as well as in the width, so that 


areas of the rectangles represent the magnitudes of the variable over time or space, or over some 
other characteristic of variation. 
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Rectangles are generally used for representing the total expenditure of two or more 
families made on various items. These are also used for making comparisons of 


expenses on various items of production and profit earned per-unit in a number of 
firms. The following example will clarify. 


Example 7: The following data represent the expenditure of the two families on various items: 


Represents the data by a rectangle diagram. 


S.No. Items Expenditure (®) 
Family A Family B 


Clothing 500 


Fuel and Electricity 


Solution: In this example, the width of the rectangles is kept in proportion to the total 
expenditure of the families, i.e. 3000 : 4500 or 2 : 3 and the height show the sub-divisions 
of the expenditures. See the Fig. 3.7. 


Expenditure of Family A & B Miscellaneous 
g Fuel & 
Electricity 
O O House Rent 
& 
Pi f Clothing 
> 
+ 
g Food 
[æ 
a. 
* 
jsa] 


Family B 


Fig. 3.7: Rectangle Diagram 
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Example 8: Represent the data in example 7 by a rectangle diagram on percentage basis. 


Solution: For representing the data by a rectangle diagram on percentage basis, we 


express the expenditure. 


On various items as percentage of the total expenditure for both the families as shown in 
Table 2: 


Table 2: Family Expenditures (in Percentages) 


ee | o foo | e | o 
[Psat oe fn | 00 [ane 


Again the widths of the rectangles are kept in proportion to the total expenditure, i.e. 2 : 3 
and expenditure is now shown in percentage and not in absolute terms. See the Fig. 3.8. 


Expenditure in % Ages © Miscellaneous 
ne I Fuel & 
se 90% Electricity 
P 80% E Hourse Rent 
Z 70% 
$ 
O, 
È 60% Clothing 
F ios La 
£ 40% LUA A, ERA 
x : S EASA H Foo 
mi 30% Bee SS 
pamm m . = 
10% ee ed 
WH: hi” #4 


Family A% Family B% 


Fig. 3.8: Rectangular Diagram 
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(ii) Squares and Circle Diagrams: Rectangle diagram is not very useful when the 
proportion between the magnitudes of the given values is quite large. For example, if 
the total expenditure of the two families in example 7 are 45,000 and 3000, then the 
width of the rectangles will be in the ratio 15 : 1 which is an inconvenient ratio to 
show diagrammatically. To overcome this situation, we use squares or circle 
diagrams. 


The procedure for drawing square or a circle diagram is quite simple. For drawing 
squares, sides of the squares are kept proportional to the magnitudes of the values 
and for circle diagrams, radii of the circles should be proportional. An example for 
such diagram will clarify. 


Example 9: The following data relate to the plan outlay of a country for three plans: 


I 
Outlay ( ‘000 Crores) 96 


Represent the data by square diagram. 


Solution: As there exist a very large proportion between the outlays of different plans, 
square diagram is suitable. For drawing a square diagram, we first calculate the square 


roots of the given values as shown in Table 3: 
Table 3 


Here the sides of the squares are in the ratio of the square roots of the outlay. The square 


diagram is shown in Fig. 3.9 next page. 


Scale 1 cms = 20 


N Uo pa 
Ue U NUUO BAU w 


— 


~ 


II Il Ill 
Fig. 3.9: Plans Square Diagram 
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Example 10: Represent the data in example 9 by a circle. 


[B.C.A. (Kurukshetra) 2011, 2005] 


Solution: In this case the radii of the circles will be kept proportional to the magnitude. 


The circle diagram is shown in Fig. 3.10: 


Plan I Plan II Plan II 


Fig. 3.10: Plans Circle Diagram 


(iii) Pie-Diagrams: Pie-diagrams are most popular diagrams used for representing 


breakdown of an aggregate into its components or sub-divisions. Thus, this 


diagram is generally used to compare the relations between various sub-divisions of 


a value. Pie-diagram is a circle divided into sectors with areas equal to the corresponding 


components. A pie-diagram shows the components or sub-divisions in terms of 


percentages only and not in absolute terms. The areas of the sectors are proportional 


to the angles subtended at the centre of the circle. For example, a pie-diagram may be 


used for representing India's tax revenue from various sources like excise and other 


taxes, income tax, corporate taxes etc. Steps in the construction of a Pie-diagram. 


(a) 


(b) 


(c) 


(d) 


(e) 


Plot a circle with its radius in proportion to the aggregate of the given values. 


Convert the component values in terms of degrees in the circle taking the 


aggregate of the values as 360°. 


For simplicity, it is desirable to arrange the component values in some logical 


pattern. For example, these may be arranged according to their sizes. 


Mark the points on the circle representing the size of each sector using a 


protractor. 


Represent each sector with different identification schemes. 
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The procedure will be clear from the following example: 


Example 11: The following data relate to faculty-wise enrolment in a college: 


Faculty 


No. of Students 2,010 1,100 2,390 5,500 


Represents the data by a pie-diagram. 


Solution: First we convert the given values into corresponding degrees in the circle 
taking the total number of students 5,500 as equal to 360°. The computation is shown in 
Table 4: 


Table 4: Calculation for a Pie-Diagram 


No. of Students Angles (in Degrees) 
Science 2010 2010 x 360 =131.56 
5500 


Arts 1100 MOO ee Pee 
5500 


Pie-diagram representing data is shown in Fig. 3.11. 


= f on 


Za 7? N ZA Science 
f f 
; y : ; oO Arts 


E Commerce 


Fig. 3.11: Pie-Diagram 


+ Important Remark 


A multiple pie-diagram is used for two-dimensional comparisons where a variable 
value is shown over time, space or in terms of some other characteristics and the 
variable values are also broken into components. The following example will 
clarify. 
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Example 12: Represent the data in example 7 by a multiple pie-diagram. 


Solution: In this case also we convert the expenditure on various items for both the 


families into corresponding degrees in the circle as shown in Table 5: 


Table 5: Computation for a Multiple Pie-Diagram 


Items of 


Expenditure 


2. Clothing 
4 


Fuel and 
Electricity 


Miscellaneous 


Family A 


Angle (in Degrees) 


i200 x 360 =144.0 


3000 


500 
3000 


x 360 =60.0 


oO 260 7210 


3000 


ze x 360 =30.0 
3000 


SW 6 54.0 


3000 


Family B 


Angle (in Degrees) 


Lano, x360 =136.0 


4500 


mo x360 =64.0 
4500 


2O A T 


4500 


ove x 360 =24.0 
4500 


S00 360 = 64.0 
4500 


A multiple pie-diagram is shown in Fig. 3.12. Here the radii of the two circles are in the 


ratio 3: 2. 
Expenditure of Families A and B. 
Family A expenditure (®) Family B Expenditure (7) 
Miscellaneous 
Fuel and 
Electricity f Food 
pone Fuel and 
Electricity 
House Rent 


x > House Rent ¥ 
Clothing 


Clothing 


Fig. 3.12: Pie-Diagram 


Ce) Elements of Statistics 


(C) 


(D) 


Pictorial Diagrams or Pictograms: Statistical data may be represented with the 
help of pictures also. Such a presentation is called pictorial diagram or pictogram. 
In pictograms, the magnitude of the values are explained with the help of pictures. 
The method is widely used by the government and private organisations. The 
method is specially useful when the trends in data are to be conveyed to the general 
public not having much of the mathematical background. In a pictogram, a 
symbolic picture represent a fixed quantity or magnitude of the variable. 
Obviously, the number of symbols represents the total magnitude of the values. 


Cartograms: Statistical data classified according to geographical regions are also 
representable with the help of suitable maps. The representation of statistical data by 
maps is called cartogram. For example, the data showing region-wise measurements of 
rainfall, region-wise density of population, etc. can be shown on a map of the region. 
Different type of shades or colours may be used for representing the magnitudes of 
the variable. 


Limitations of Diagrammatic Representation 
Diagrams are used to have a quick knowledge of the pattern of data. However, 


these cannot be considered as a substitute for classification or tabulation. When 
compared with the tabular form of data, diagrams have the following limitations: 


(i) | Data ona large number of variables cannot be presented in a diagram while a 


tabular form may include the same. 


(ii) Diagrams presents a vague comparison between various magnitudes of a 


phenomenon and it is generally not exact. 


(iii) Diagrams are useful for conveying the message or pattern of data to a layman 
while an expert can draw the necessary conclusions from the tabular form of 
data itself. 


3.1.2 Graphical Presentation 


There are many forms of graphs which can be broadly classified as: 


I 
2. 


Graphs of the Frequency Distribution 


Graphs of the Time Series or Line Graphs 


3.1.2.1 Graphs of the Frequency Distribution 


The graphs representing a frequency distribution are: 


Histogram 
Frequency Polygon 
Frequency Curve 


Cumulative Frequency Curve or ‘Ogive’. 
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The System of Co-ordinate Axes; General Principles 


Graphing is done with reference to two perpendicular lines called co-ordinate axes. 
The vertical line (_yy’) is called y axis and the horizontal (xx’ ) as x-axis. The point of 
their intersection is called the origin. Fig. 3.13 represents a system of co-ordinate 
axes. 


J 


II Quadrant I Quadrant 
(- ? ot ) ( a ? + ) 


l P (7,-3) 
R (-2, -7) i IV Quadrant 
a (+,-) 
IJ Quadrant 
(-,-) y' 


Fig. 3.13: The System of Coordinates Axes 


In Fig. 3.13 the origin, denoted by O, is the zero point or reference point for the 
two axes. All distances measured along the x-axis to the right of O are called 
positive and those to the left of O negative. Similarly, distances measured along 
the y-axis above the origin are positive, and below it negative. Four divisions 
resulting by the intersection of two axes are called quadrants. In the upper right 
division or first quadrant, both x and y measures are positive (+, +). In the upper 
left division or second quadrant, x is negative and y positive (-, +). In the lower left 
division or third quadrant, both x and y measures are negative (-, —); while in the 
lower right or fourth quadrant, x is positive and y negative (+,—). These four 
quadrants have been marked in figure The distance of a point from origin along x axis is 
called abscissa; and the distance of the point from origin on the y axis the ordinate. 


Histogram: In such graphs the class-intervals are plotted along x-axis and frequencies 
along y axis. The first class-interval is taken on x-axis and on it a rectangle with the 
frequency corresponding to that class interval as height is erected. Then the second 
class-interval is taken on x-axis and on it a rectangle with the frequency corresponding 
to this second class-interval as height is erected. In this way rectangles corresponding to 
each of the given class-intervals are erected side by side. 


The figure so formed is known as histogram. Here the area of rectangles are 
directly proportional to their corresponding class frequencies. 
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Note 1: In order to draw a histogram, it is necessary that the class-intervals are given by 
exclusive method. If these are given by inclusive method, then these should be first of all 


converted into exclusive method. 


Note 2: If the given class intervals are not equal, the frequencies are divided by the 
lengths of the corresponding class-intervals the heights of the columns or rectangles are 
taken proportional to these and ratios so that the areas of these rectangles are 


proportional to the respective frequencies. 


Example 13: Construct a histogram to represent the following data: 


Class-Interval 10—20 | 20—30 | 30—40 | 40—50 | 50—60 


Solution: 


Frequency 
80 


70 
60 
50 
40 
30 
20 


10 


0-10 10-20 20-30 30-40 40-50 50-60 
Score Classes 


Fig. 3.14: Histrogram 


Take the class intervals on x-axis and frequencies on y-axis. Since the frequency for the 
class-interval 0-10 is 25, so erect on the line depicting 0-10 on x axis as base a rectangle 
of height 25. Similarly erect on the line depicting the class interval 10-20 on x axis as 
base a rectangle of height 35 and so on. Thus we erect six rectangles corresponding to 


given six class intervals. The Fig. 3.14, so formed the required histogram. 


+ Histogram (Unequal Class-Interval) 


To take an example, let us arbitrarily group classes 30-39 and 40-49 into one class 
as 30-49 in Table 6. 


_—— 
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Table 6: Frequency Distribution 


20—29 19.5—29.5 
30—39 29.5—39.5 
40—49 39.5—49.5 


50—359 49. 5—59.5 
60—69 59.5—69.5 
70—79 69.5—79.5 


Similarly, classes 60-69 and 70-79 are grouped as 60-79. Grouped classes and 
corresponding frequencies thus obtained are shown in Table 7. 


Table 6, Data in Table 7, Grouped into Unequal Class-Intervals 


Classes Limits Interval | Frequency | Rectangles 
=50 


10 


80—89 79.5—89.5 EJ 
Pro fp 


In Table 7 the class-interval of second and fourth class is twice that of the rest of 


4/1=4 


the classes. Thus, the frequencies in second and fourth classes are not comparable 
with other classes. To establish this comparability, the frequencies in the larger 
classes should be halved or divided by two. Thus, before forming histogram for 
frequency distribution with unequal class-intervals, all larger classes should be 
expressed as multiples of smaller classes; and then divide the corresponding class 
frequencies by these multiples. This division then, gives the height of rectangles 
(as given in the last column of Table 7) to be formed over given classes. Now, if 
the class-interval 10 of the class 19.5—29.5 is taken as one unit then class interval 
20 of the class 29.5-49.5 will be equal to 2 units. Thus, a rectangle of height 7 


(ii) 


Elements of Statistics 


formed over 29.5-49.5 will represent an area 7 x 2 =14 which is, in accordance 
with the principle of histogram, equal to class frequency. Similarly, the height of 
the rectangle to be formed over class 59.5-79.5 will be 8 so that its area8 x 2 =16 
will represent the class frequency. However, heights of other rectangles formed 
over classes of unit lengths will remain equal to corresponding class frequencies. 
The histogram obtained for the frequency distribution of scores in Table 7 is 
given in Fig. 3.15. 


Class Frequency 


Height to the Rectangle 


9.5 19.5 29.5 49.5 59.5 79.5 89.5 


Score Classes 


Fig. 3.15: Histogram 


We have seen that histogram is a graphical representation of a frequency 
distribution. It is supposed to represent all the characteristics of a good table. 
Histogram too should be given a clear, brief and self-explanatory title. Index 
showing the scales of measurements on both axes should be given in the upper right 
corner of the figure and units of measurements should be mentioned along the axes 


so as to make the graph easily readable. 


Frequency Polygon: In these polygons, the value of the variate is assume to be 
concentrated at the mid-value of the interval. To construct this polygon, the 
mid-value of the class-intervals are taken as abscissae or x-co-ordinates and the 
corresponding frequencies are ordinates or y co-ordinates and these points are 
plotted on paper. These points are then joined in order by straight lines. Finally the 
first of these points is joined to the lower limit of the first interval and the last of 
these points to the upper limit of the last interval. The polygon so formed by these 
lines and x-axis is called frequency polygon. The defect of frequency polygon is 
due to fact that the value of the variate is assumed to be concentrated at the 


mid-value of the class-interval which is generally not true. 


"a 
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Example 14: Construct the frequency polygon to represent the following data: 


Class-Interval 0—10 10—20 | 20—30 30—40 40—50 50—60 


5 


[B.C.A. (Meerut) 2006] 


Solution: Find out the mid-values of the intervals i.e. 5, 15, 25, 35, 45, 55 and take 
them as x co-ordinates and the frequencies 25, 35, 70, 55, 30, 20 of the class interval 
(0-10), (10 —20)... (S50 —60) as _y-coordinates and thus we have the points (5, 25), (15, 
35), (25,70), (35, 55), (45, 30), (55, 20). Plot these points and join them in order by 
straight lines. Then join first of these points (5, 25) to (0, 0) as 0 is the lower limit of the 
first class-interval 0-10 and join last of these points (55, 20) to (60, 0) as 60 is the upper 
limit of last class interval 50-60. 


80 
70 


60 
50 
40 
30 


Frequency 


20 
10 


10 20 30 40 50 60 
Fig. 3.16: Frequency Polygon 


The polygon so obtained is the required frequency polygon. See the Fig. 3.16. 


Aliter: Here we mark the mid-points of the tops of each rectangle or column of the 
histogram and join them in order by straight lines. Then join first of the above marks 
with the point depicting the lower limit of the first interval and the last of the above 
marks with the point depicting the upper limit of the interval. The polygon so obtained 
in the required frequency polygon. 


(iii) Frequency Curve: If in a group distribution, the width of the class-intervals 
become very small then the number of observations increases and the frequency 
polygon tends to a smooth continuous curve which is called frequency curve. To 
draw the frequency curve, we plot the points as in the case of frequency polygon 
taking the mid-points of the class-intervals as abscissae and corresponding 
frequencies as ordinates. Then a free hand curves is drawn through these points 
which is known as frequency curve. See the Fig. 3.17 next page. 
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Fig. 3.17: Frequency Polygon 


The frequency curve corresponding to the data given example 14 is given in the 
adjoining Fig. 3.18. 
y 
80 
70 
60 
50 
40 
30 
20 


5 15 25 35 45 55 60 


Fig. 3.18: Frequency Curve 


(iv) Cumulative Frequency Curve or ‘Ogive’ 
[B.C.A. (Kanpur) 2008, 2005; B.C.A. (Meerut) 2004, 2002] 
A cumulative frequency curve or “Ogive’ is a graphical representation of a cumulative 


frequency distribution. The ‘less than’ or ‘more than’ cumulative frequency tables 
in Table 8 gives either the number. 


oad 
( 6 


Frequency Distribution and Graphical Representation A 69 k 


(a) 


Table 8 ‘Less than’ and ‘More than’ Cumulative Frequency 
Table for the Data in Table 6. 


Score Classes More than C.F. 
Less than 29.5 50 

Less than 39.5 4 

Less than 49.5 40 


Less than 69.5 More than 59.5 20 
Less than 79.5 More than 69.5 l 
Less than 89.5 More than 79.5 


of students scoring below the upper class limits or above the lower class limits. But 


Less than 59.5 More than 49.5 


these tables fail to provide the number of students scoring below or above the 
intermediate score within a class. For example, the cumulative frequency table in 
Table 8 shows that 10 students have scored below 39.5 and 46 have scored above 
29.5, but it fails to give the number of students scoring below or above 35 which is an 
intermediate score in class 29.5-39.5. We may further need, for example, the number 
of students in various grades, say A, B or C where grade A is given to a score of 65 or 
more, B for above 45 and below 65 and C for the rest. Though, this sort of information 
could be obtained by using arithmetic interpolation between class limits, yet it can be 
obtained from ogives with less effort and sufficient accuracy. Ogives are also drawn for 
determining median, a measure of central tendency, as will be discussed in the First 
chapter. There are two forms of ogive depending on the types of cumulative 
frequencies, namely— 


Less than ‘Ogive’: In the construction of ‘ogives’ also, we use true class limits. In 
the formation of less than ‘ogive’, upper true limits of the classes are plotted along 
the horizontal axis and the corresponding less than cumulative frequencies are 
marked along the vertical axis. The points so obtained are then joined by a free 
hand smooth curve to get less than ‘ogive’ as given in Fig. 3.19, for the cumulative 
frequency distribution in Table. 


The ogive in Fig. 3.19 can now be used to get number of students scoring below a 
given score. For example suppose the number of students scoring below 35 is to be 
estimated. Then 35 is first marked on horizontal axis as a point say A and a vertical 
line drawn through this point cuts the ogive at some point, say B. This cutting 
point is then projected on vertical axis to a point C. The point C, when measured 
on vertical axis, gives the required number of students as 7. 
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Under True Class Limits Ogive Less than, 
Cumulative Frequency 


Fig. 3.19 


(b) More than ‘Ogive’: For more than ‘ogive’ true lower limits of the classes are 
plotted along the horizontal axis and the corresponding more than cumulative 
frequencies are marked along the vertical axis. The points are then joined by a 
smooth free hand curve to give more than ‘ogive’ as shown in Fig. 3.20. The 
cumulative frequency distribution in Table 8 has been used for the purpose. 
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Ogive More than; Cumulative Frequency 
Fig. 3.20 
More than ‘ogive’ in Fig. 3.20, can now be readily used to find the number or 


percentage of students scoring above a given score. For example, in estimating the 
number of students scoring above 45, we first mark 45 on the horizontal axis as A. 
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Now, a vertical line through this point cuts the ogive at point B. This point B is 
then finally projected to vertical axis as point C. This point C, when measured on 
vertical axis, gives the required estimated number of students as 36. 


Example 15: Discuss the role of diagrams in presenting statistical data. 
Or 
Explain the utility of diagrammatic presentation of data. 
[B.C.A. (M.D.U.) Rohtak 2010, 2009, 2005, 2003; 
B.C.A. (Lucknow) 2010; B.C.A. (Rohilkhand) 2008, 2007] 


Solution: In order to provide an attractive form to the classified data, these are 
presented as tables, diagrams and graphs. Although the complex data are presented in an 
easy form through rows and columns under tabulation, yet these fail to attract the people 
because a special technical knowledge is required to understand them. Moreover, it is the 
best medium of comparison between two or more items, Data can be presented in 
diagrammatic form only if these are also available in tabular form. 


Example 16: Explain the rules, types and limitations of presentation of data. 
[B.C.A. (Lucknow) 2005, 2007; B.C.A. (Agra) 2002, 2004] 


Solution: The following rules must be followed while making a diagrammatic 
presentation of data, so that, it may made attractive: 


l. Suitable Title: A suitable title is necessary to make the diagram attractive. The 
title must be brief and clear, so that, the objective of diagram may become clear 
Just by having a view of it. 


2. Size of Diagram: The size of diagram must also be taken care of while presenting 
the data in the form of a diagram. The size of diagram should be in accordance with 
that of the paper. 


3. Use of Colours and Sings: Various colours and sings must be used in order to 
provide an attractive form to the diagram. A symbol chart should be drawn on the 
right side of the diagram. 


4. Scale: An appropriate scale should be determined before presenting data in the 
form of diagrams. The scale must be neither very big nor very short. 


J: Limited use of Words and Numericals: Least possible use of words and 
numerical must be made while drawing diagrams. 


6. Types of Diagrams: Diagrams are of various types like diagram, pie diagram, 
rectangular diagram etc. it is not necessary that every diagram can be used in each and 
every situation. So, it is necessary to select the appropriate diagram well in advance. 


7. Presentation of Importants Facts: Use of diagrams is an attractive method of 
presentation of important facts. With their help, the important facts can be 
presented in an attractive form by making use of symbols, colours etc. so that these 
facts may be easily conveyed to the public. 
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8. Use of Graph Paper: Graph papers can also be used while presenting data in the 
form of diagrams. The diagrams can be given more attractive and simpler form 
through the graphs. 

9. Clarity: While presenting diagrams. Special attention should be paid to their 
clarity. Much information must be include in a single diagram as it will be difficult 
to understand such a diagram. 

Thus diagrams can be made more attractive by paying attention to the above given 
facts. 

3.1.2.2 Types of Diagrams 


Many diagrams are used to present data. Their explanation is as follows: 


l. 


One Dimensional Diagram: These are also called ‘Bar Diagrams’. These 
diagrams are widely used. In these diagrams, are widely used. In these diagrams, 
the height is given more importance than the breadth or thickness of the diagram. 
Under it, the data are presented in the form of bars or rectangles. Any person can 
understand these diagrams easily. One dimensional or bar diagrams may be of the 
following types: 


(i) Line Diagram: When the data are also presented through simple lines only, 
then these are called line diagrams. 


(ii) Simple Bar Diagram: These are also prepared like line diagrams. The 
difference between the two is that while preparing it, much attention is 
provided to breadth in order to make the diagram more attractive. 


These are used to demonstrate only a single item. 


(iii) Multiple Bar Diagram: When two or more variables are depicted in a single 
diagram, then it is known as multiple bar diagram. These diagram have a 
great role to play in Comparative study. 


(iv) Sub-Divided Bar Diagram: Sub-divided bar diagrams are used in a case 
when the sub-parts of two or more variables are also given. Under it, all bar 
diagrams are divided in different parts and various colours and signs are used 
to show the difference. 


(v) Percentage Bar Diagram: In this method, first of all, the values of variables 
are depicted in length of each variable is assumed to below. Every part is 
presented in different colours or symbols. 


(vi) Deviation Bar Diagram: These are also called “Dio-dimensional Bar 
Diagrams’. These diagrams are used to show the deviations with reference 
to time and situation. Deviation is divided into two parts; (i) Positive 
Deviation and (ii) Negative Deviation. Positive deviation are shown above 
x-axis and Negative deviation are shown below x-axis. 
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2. 


Two Dimensional Diagrams: The length and breadth are paid special attention 
in these diagram. The bars represent the data. These diagrams have the following 
forms: 


(i) Rectangular Diagram: Both the length and breadth are paid special 
attention in these diagrams. These are also known as ‘Area Diagrams’. 


(ii) Percentage Rectangular Diagram: Under it firstly, the items are depicted 
in percentage form and then, in the form of rectangle. The length of bar is 


assumed to be 100. There after, the rectangular diagram is prepared and 
sub-divided. 


(iii) Square Diagram: The length and breadth of such diagrams are equal. These 
diagrams are used mainly in such situations when the number of items is 
more and it is not possible to present them in the form of bar diagrams. 


(iv) Circle or Pie Diagram: Under it, a circle is divided into different part and 
the items are presented in these parts in percentage form. 


Three-Dimensional Diagrams: These diagrams are used when the number of 
item is very large and it is not possible to present them even through Pie or Square 
diagram. In these diagrams, each of length, breadth and thickness has special 
importance. 


Thus, it is clear that data can be presented through diagrams in many ways. An 
appropriate diagram should be selected on the basis of nature of the data. 


Limitations of Diagrammatic Presentation 


The description of limitations of diagrammatic presentation is as follows: 


(i) Misuse of Diagrams: It is possible to misuse the information presented 
diagrammatically because if the diagrams are not prepared properly, then 
they will not present the right information. 


(ii) Difficulty in Analysis: Data are not presented in diagrammatical form 
when it is essential to make their comparative study but the 
diagrammatically presented data can not be analysed. It is very difficult to 
make a final conclusion on their basis. 


(iii) Not Useful for Researcher: The diagrammatic presentation of data is not 
useful for the researchers because it does not provide any help to the 
researchers in the analysis of data. 


(iv) Helpful in Estimation Only: The complete information can not be 
obtained by presenting the data diagrammatically. It is helpful only in 
providing an estimation of data. On the contrary, complete information can 
be easily obtained if the data is presented in a tabular form. 


(v) Not Helpful in Presenting Much Information: It is not possible to make 
use of diagrams when a lot of information is to be presented. Only the simple 
and small data can be presented by using diagrams. 
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(vi) 


(vii) 


Presentation of Approximate Values: The numerical data can not be 
presented through diagrams, as a result of which, these lack accuracy and it 
also becomes difficult to obtain useful information. Only an approximate 
value of data can be obtained through diagrams. 


Complexity of Diagrams: Some diagrams are very complex like 
two-dimensional and three dimensional diagrams. Due to complexity, these 
diagrams can not be easily understood by public. A special skill is required to 
understand them. 


(viii) More Difference in Data: If there is a lot of difference in data, then it is 


almost impossible to present them through diagrams. For example, if the 
data are in a ratio of 6 : 8000. Then their scale con not be prepared. 


Thus, it is clear from the above that various limitations of diagrams are also 
there along with their uses. 


Example 17: What is a graphic presentation of Data ? Discuss its merits and demerits. 


[B.C.A. (A.M.U.) 2009] 


Solution: Graphic presentation is also an attractive method of presenting the data. 


Although the data related to time is presented though diagrams. Yet in order to present 


frequency distribution, time series etc. mainly graphs are used. Graph papers are used for 


it. The items depicted on graph paper are simple, clear, accurate and more attractive. 


+ 


Merits of Utility of Graphic Presentation 


The explanation of main merits of graphic presentation is as follows: 


(i) 


(ii) 


(iii) 


(vi) 


(v) 


(vi) 


Simple and Attractive: Data can be presented in a simple and attractive 
manner through graphs, as a result of which, these can be easily understood 
by public. 


Helpful in Estimation: With their help, even the future data can be easily 
estimated, For example with the help of data about sales from 1982 to 1992. 
The sales in the year 2000 can be easily estimated. 


Helpful in Comparison: Graphs are also helpful in making comparison in 
the items related to various time. 


More Memorable: Due to large number of data, it is very difficult to 
remember them, but it becomes easy if the data are presented graphically 
because graphs have a longer image on people mind. 


Estimation of Correlation: With their help, the correlation between two 
data can be estimated by presenting them on a single graph. 


Knowledge of Frequency Distribution: The knowledge of the nature of 
frequency distribution can be easily obtain with their help. Frequency 
distribution can be depicted in various forms like histogram, frequency 
polygon, simple frequency curve, cumulative frequency curve etc. 
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Demerits of Graphic Presentation 


(i) 


(ii) 


(iii) 


(vi) 


(v) 


(vi) 


Lack of Proof: The graphically presented data can not be used as proof 
because these lack accuracy. Their only objective is to proses the complex 
data in a simpler form, so that, people may understand them easily. 


Difficult to Understand: The data presented through graph is difficult to 
understand as only the person having technical knowledge can understand 
then so, it is difficult for ordinary man to understand these graphs. 


Difficult in Analysis: The graphically presented data can not be used as 
proof because these lack accuracy. Their only objective is to present the 
complex data in a simple form, so that people may understand them easily. 


Wrong Conclusions: The conclusions drawn from graphs are not accurate 
and generally, wrong indications are provided by them because a little 
change in criterion may lead to a great change in line. 


Provides Inadequate Information: The graphic presentation of data 
provides inadequate information only. 


Misuse of Graphs: There may be various misused of graphic presentation 
of data because if the graphs are not prepared properly, then these will not 
provide right information. 


Example 18: Represent the following data by mean of Histogram: 


Weekly Wages 10-15 | 15-20 | 20-25 | 25-30 | 30-40 | 40-60 | 60-80 


[B.C.A. (Avadh) 2010, 2008] 


Solution: Since the class intervals are unequal frequency are to be adjusted. 


Computations of New Frequency of Frequency Density 


Ce | e e e 
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Fig. 3.21: Histogram 


Example 19: Present the following data diagrammatically: 


Firm A Firm B Firm C 
> < > 


Raw Material R) 
Labour R) 
Misc Expenses R) 


Total Cost R) 
Sales Price R) 


[B.C.A. (Purvanchal) 2008; B.C.A. (A.M.U) 2005] 


Selling prize on % age basis 


Raw material (7) 
Labour (®) 
Misc expenses (®) 


Total cost (®) 


Sales price (2) 


Profit or Loss 
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Fig. 3.22: Bar Diagram Firm C 


3.2 Graphs of the Time Series 


A series of observations recorded over time is called a time series. For example data related to 
the population of a country recorded at ten-yearly intervals, annual production of crops, 
the wholesale price index over a number of months, the daily closing price of a share on 
the stock exchange. In fact, the data related with business and economic activities in general, 
recorded over time gives rise to a time series. Obviously the variations in a time series are 
most important to an economist. The variations in the time series can also be presented 
graphically. These graphs are called line graphs. 


Steps in the construction of line graphs. 


1. x-axis is used as time scale, i.e. year, months or days are marked along x-axis. 
2. Variations in the variable are to be marked along y-axis on a suitable scale. 
3. The marked points are joined by straight lines to show the pattern of variation. 


The following example will clarify: 


Example 20: Represent the following data by a suitable line graph. (Rate per 1000 population 


per annum) 


ee eae Zz 
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Solution: The line graph is shown in Fig 3.23: 


Birth Rate 


AY AY | 


ugi thy i 


Rat 


Rate per Thousand 


1911-21 1921-31 1931-41 1941-51 1951-61 1961-71 1971-81 
Line Graph Showing Birth, Death and Growth Rate 1911-81 


Fig. 3.23: Line Graph 


Example 21: Represent the given data by a histogram. 


[Class | 30 35 | 35-40 | 40-45 | 45-50 | 50-55 | 55-60 | 60-65 


Pwo [tts e e e 


[B.C.A. (Bundelkhand) 2006] 


Solution: 37 


YYW). 


YW 


We 


Roll Number of Students 


Fig. 3.24: Histogram 


Example 22: Draw a histogram from the following data: 


1-10 11-20 21-30 31-40 41-50 


mio [ote |» |e |< 


Solution: From the data it is clear that it is an inclusive series and therefore, it will be 


converted into an exclusive series. 
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Fig. 3.25: Histogram 


Example 23: Present the following frequency distribution by a histogram: 


Solution: In this example mid-values are given and these mid-values are to be 


converted into class intervals by the method stated below: 


Get the difference of second and first mid-values i.e., 7.5 -2.5 = 5 
Divide the difference by 2 i.e., G) =2.5. The quotient 2.5 is subtracted and added to the 


first mid-value i.e., 2.5 -2.5 =O and 2.5 + 2.5 =5. Thus, class interval for first mid value 
is 0-5. Likewise, other class intervals are obtained for other mid-values, which are as 


follows: 


Class Intervals 10-15 15-20 20-25 
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Fig. 3.26: Histogram 


Example 24: Represent the following data by a histogram: 


ptosis | 6 | 


C o O SO 


Solution: The class intervals are unequal and therefore, frequencies are to be adjusted 


by finding an adjustment factor. Since the lowest class interval is 5, the frequencies of 
class interval 130-140 shall be divided by 2 because the class interval is double. 
Similarly, the frequencies of class interval 140-160 and 160-180 shall be divided by 4 
each. In other words the adjustment factor for class interval 110-115, 115-120, 
120-125 and 125-130 is 1 each, while for 130-140, it is 2 and for 140-160 and 
160-180, it is 4 each. This has been summarised in the following table: 


= | A A 
i ee ee a ees as 


a | 
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Weekly Wages (7) 
Fig. 3.27: Histogram 


Example 25: Draw a frequency polygon from the following data: 


Eregueng | 6 | 12 | w | is | 8 | 4 | 
[B.C.A. (Meerut) 2004] 


Solution: We find the mid value of each class and record them as below: 
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Fig. 3.28: Frequency Polygon 
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Example 26: Draw a histogram, frequency polygon and frequency curve from the following data: 


Class Interval 10-20 | 20-30 | 30-40 | 40-50 | 50-60 60-70 


reno [stot etafel? 


[B.C.A. (Meerut) 2010] 


Solution: 
22 
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Fig. 3.29: Histogram, Frequency Polygon and Frequency Curve 


Example 27: 


Find less than and more than cumulative frequencies and draw ogives. [B.C.A. (Meerut) 2012] 


Frequency Distribution and Graphical Representation C 83 > 


Solution: If we cumulate the frequencies of the above distribution, we obtain a 


cumulative frequency distribution as given below: 


Like a frequency curve of a frequency distribution, we can draw a cumulative 
frequency curve or ogive of a cumulative frequency distribution. When a series is 
cumulated upward it is called less than ogive and in this case ogive curve will never 
fall. Similarly, if a series is cumulated downward, it is called more than ogive and it 


will be a declining curve. 
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Fig. 3.30: Less than Ogive 3.31: More than Ogive 
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Fig. 3.32: Less than and More than Ogive 


Example 28: Draw histogram, frequency polygon and cumulative frequency curve from the 
following data: 


[B.C.A. (Kanpur) 2006] 


Solution: 


Frequency 


10 15 0 25 30 
Classes 


Fig. 3.33: Histogram and Frequency Polygon 
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Fig. 3.34: Commutative Frequency Curve (Ogive) 


Example 29: Prepare an ogive from the following data: 
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[B.C.A. (Bundelkhand) 2008] 
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Solution: 


Fig. 3.35: Cumulative Frequency Curve (ogive) 


Example 30: Following is the age distribution of a group of students. Draw the cumulative 
frequency curve of less than type and hence obtain the median value: 


[B.C.A. (Meerut) 2010] 


Solution: From the given table, we may prepare the (less than type) cumulative 
frequency table as shown on the next page. 


_— 
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On a graph paper, we take the scale 

Along the x-axis: 5 small div. = 1. 

Along the y-axis: 1 small div. = 10. 

And, plot the points A(5,36), B(6,78), C(7,130), D(8,190), E(9,258), F(10,342), 
G(11,438), H (12,520), I (13,586), J (14,634), K(15,684), and L(16,700). 

We join these points successively with a freehand to get the cumulative frequency curve 
or an ogive. 


Here, N =700 = =350 


109 Scale: o the x-axis, 5 small div. = 1. 


Along the y-axis, 1 small div. = 10. 


600 
500 
400 
(0, 350) P H---------------------- 
300 
200 
100 


5 6 7 8 910 1112131415 16 X 


Fig. 3.36: Cumulative Frequency Curve 
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Take a point P (0,350) on the y-axis and draw PQ || x-axis, meeting the curve at Q. Draw 
QM l x-axis, intersecting x-axis at M. 

Then, OM =10 units. 
Hence, median = 10. 


Example 31: For the following frequency distribution, draw a cumulative frequency curve of more 


than type and hence obtain the median value: 


Class Interval 10 — 20 | 20 - 30 | 30 — 40 | 40 - 50 | 50-60 | 60 - 70 


Feens | 5 | s|» | 2s [7 fom | 9 | 
[B.C.A. (Avadh) 2011] 


Solution: From the given table, we may prepare the ‘more than’ series as shown below: 


EE 
O ew [a 


Along the x-axis,10 small div. = 5. 
Along the y-axis, lsmall div. =1. 


Scale: | 


Now plot the points A (5,100), B (10,95), C (20,80), D (30,60), E (40,37), F (50,20) and 
G (60,9). 


Joint AB, BC, CD, DE, EF and FG with a freehand to get the required curve, as shown 


below: 


Here, N =100 >^ =50. 
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Along the x-axis, 10 small div. = 5 
Scale: 7 
Along the y-axis, 1 small div. = 1 


I 
I 
I 
I 
I 
l 
I 
I 
I 
I 
I 
I 
' 
I 


M 
a.. 
5 10 15 20 25 30 35 40 45 50 55 60 X 
Fig. 3.37: Cumulative Frequency Curve More Than Type 


From P (0,50) draw PQ || x-axis, meeting the curve at Q. Draw QM LOX, meeting x-axis 
at M. Clearly, OM =35 units. 


Hence, median = 35. 


Example 32: The following table gives production yield per hectare of wheat of 100 farms of a 
village: 


Mam |a| e | | o|o] 


Change the distribution to a ‘more than type’ distribution and draw its ogive. 
[M.D.U. (Rohtak) 2011] 


Solution: We may prepare the more than series as shown below: 


Along the x-axis,1 small div. = 1. 
Along the y-axis, l small div. =1. 


Scale: | 


On the graph paper, we plot the points A (40,100), B (45,96), C(50,90), D (55,74), 
E(60,54) and F(65,24). 


= 
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Join AB, BC, CD, DE and EF with a free hand to get a “More Than Ogive’. 


Y 
100 


80 


40 More Than Ogive 


No. of farms ————> 
a x 
-= © 


< n 
xo 40 45 50 55 60 65 x 


Y’ Production yield (kg/hectare) ——> 
Fig. 3.29: More Than Type Ogive 


Example 33: During the medical checkup of 35 students of a class their weights were recorded as 
follows: 


Weight (in kg.) | 38-40] 40-42 | 42 -44 | 44-46 | 46 -48 | 48 -50 | 50-52 
Draw a less than type and a more than type ogive from the given data. Hence obtain the median 
weight from the graph. [B.C.A (Agra) 2008] 


Solution: 
(i) Less than series: 


We may prepare the less than series as under: 


aa a E E 
| tessthana2 | O O 5 O 
| Lesthans4 | 9 O 


Less than 46 
Less than 48 
Less than 50 


= 
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Along the x-axis,5 small div. = 1kg. 


Scale: ie the y-axis,10 small div. =5 kg. 


We plot the points A (40,3), B (42,5), C(44,9), D (46,14), E (48,28), F (50,32) and 
G (52,35). 
Join AB, BC, CD, DE, EF and FG with a free hand to get the curve representing ‘less 
then series’. 


(ii) |More than series: 


We may prepare the more than series as under: 


On the same graph-paper as above we plot the points P (38,35), Q (40,32), 
R (42,30), S (44,26), T (46,21), U (48,7) and V (50,3). 


Join PQ,QR,RS,ST,TU and UV with a free hand to get the curve representing 


c ra? 
more than series’. 


38 40 42 44 46 48 50 52 
Weight in kg ——> 
Along the x-axis, 5 small div. = 1 kg. 
Along the y-axis, 10 small div. = 5 students. 


Fig. 3.39: Ogive 


Y’ Scale: l 


The two curves intersect at the point L. Draw LM LOX. 
Median weight = OM = 46.5 kg. 
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Example 34: The table given below shows the frequency distribution of the scores obtained by 200 
candidates in a B.C.A. entrance examination. 


[score |200 250} 250-300 | 300-350 | 350-400 | 400-450 | 450-500 | 500-550 | 550-600 
40 


No. of 
Mena | | > | | 2 | | fo fos 


Draw cumulative frequency curves by using (i) ‘less than series’ and (ii) ‘more than series’. 
Hence, find the median. [B.C.A. (Lucknow) 2010, 2007] 


Solution: 
(i) Less than Series: 


We may prepare the less than series as under: 


Along the x-axis, 1 small div. = 5. 
Along the y-axis, 1 small div. = 2. 


Taking the scale: | 


We plot the point A (250,30), B (300,45), C (350,90), D (400,110), E (450,135), 
F (500,175), G (550,185) and H (600,200) on a graph-paper. 


Join AB, BC, CD, DE, EF, FG and GH with a free hand to get the curve representing 
‘less than series’. 
(ii) |More than series: 


We may prepare the more than series as under: 
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Now, on the same graph-paper as above, we plot the points P (200,200), Q(250,170), 
R (300,155), $(350,100), T (400,90), U (450,65), V(500,25), G(550,15). 


Join PQ, QR, RS, ST, TU, UV and VW with a free hand to get the curve 
representing ‘more than series’. 


Along the x-axis, l small div. =5. 


Scale: {Aire the y-axis, I small div. =2. 


Y IP 


20 ' 
M; 

< . > 
xO yr? 100 150 200 250 300 350 400 450 500 550 600 X 


Fig. 3.40: Cumulative Frequency Curves 


The two curves intersect at the point L. Draw LM LOX, cutting x-axis at M. 
Clearly, M represents 375. 


Hence, median = 375. 


| Problem Set N 


1. Explain the method of constructing frequency polygon. 


Dis An opinion poll in an office showed that 40% were congressites 10% belonged to 
B.J.P. 41% with communists and 9% none of the above three. Draw a pie diagram. 


3: A survey was taken among 100 students of a certain science college to find the 


most popular subject and the result was shown below: 


Present this information by (i) Bar-diagram (ii) Pie-diagram. 
[B.C.A. (I.G.N.O.U) 2012] 


10. 
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Tee 


College students of Meerut district are classified according to sex and living 
condition for 2007-08 as follows: 


Construct a bar-diagram to display simultaneously a comparison between the total 
numbers of students living in each type of accommodation and sex composition of 
each total. 


Draw the histogram and frequency polygon for the following data: 


Write short notes on the following: 


(a) Histogram (b) Frequency polygon 
(c) Frequency curve and (d)  Ogive 
[B.C.A. (Garhwal) 2011; B.C.A. (Bundelkhand) 2010, 2007, 2005] 


Is there any relation between frequency polygon and frequency curve ? 


Draw the ogive and frequency polygon for the following distribution: 


Class-Interval meg 16—24 | 24—32 | 32—40 40—48 


Use rectangle diagram to present the following data: 


a 


Draw a cumulative frequency curve (less than type) for the following data and find 


the median from it: [B.C.A. (I.P.U., Delhi) 2012; IL.G.N.O.U. 2010] 


noo i 


Frequency Distribution and Graphical Representation N 95 > 


11. Following is the distribution of marks of 70 students in a periodical test: 


Draw a cumulative frequency curve for the above data and find the median. 
[B.C.A. (Purvanchal) 2005] 


12. The following table gives the heights (in meters) of 360 trees: 


eee ee 
| Testhanl4m | 45 


From the above data, draw an ogive and find the median. 
[B.C.A. (Kashi Vidhyapeeth) 2011] 


Hint: Along the x-axis, take 10 small div.=7 m. 
5 Along the y-axis, takel small div.=5 trees. 


13. From the following frequency distribution, prepare the “less than ogive’. 


E E A 
C oa a 


Also, find the median. [B.C.A. (Meerut) 2012] 


14. 


15. 


16. 
17. 
18. 
19. 


From the following frequency distribution, prepare the ‘more than ogive’: 


ai a 
450-300 Ps 
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Also find the median. [B.C.A. (Rohilkhand) 2009] 


The marks obtained by 100 students of a class in an examination are given below: 


a ee ae 
EE he a 
Oo Bo | 8 O 


Draw cumulative frequency curves by using (i) ‘less than series’ and (ii) ‘more than 


series’. 


Hence, find the median. [B.C.A. (Kashi Vidhyapeeth) 2010] 


Use rectangle diagram on percentage basis to represent the data in Problem No. 8. 


Represent the data in Problem No. 8 by using a multiple Pie-diagram. 
Explain the method of constructing histogram. 


Explain the method of constructing ogive. 


uun 


Chapter 


Measure of Central 
Tendency 


4.1 Averages 
Objectives of Averages 

Į; Averages help to get a representative value of the entire mass of data 
2. Averages facilitate comparison 


3. Averages help in decision making 


4.1.1 Characteristics of a Good Average 


As the average is a single representative value of the mass of complex data, it must have 


the following properties: 

(i) It should be rigidly defined 

(ii) It should be simple to calculate and easy to comprehend 

(iii) Its calculation should be based on all the observations 

(iv) It should be capable of further algebraic treatment 

(v) It should not be affected by extreme values of the observations 


(vi) It should be least affected by fluctuations of sampling. 
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4.2 Measures of Central Tendency 


In statistics, various measures of central tendency or averages are studied. Three 
important measures of central tendency are : 


1. Arithmetic Mean (A.M.) or Mean 


2. Median 
3: Mode 
4.2.1 Arithmetic Mean 


Arithmetic mean, also called ‘average’ in general sense, is the most popular and widely 
used measure of central tendency. Arithmetic mean is a number obtained by dividing the sum 
of given observations by their number. Arithmetic mean is of two types: 


l. Simple Arithmetic Mean 


2: Weighted Arithmetic Mean 


4.2.1.1 Simple Arithmetic Mean 


l. Calculation of Simple Arithmetic Mean: In classification and tabulation of data, 
we observed that the values of the variable or observation can be put in the form 
of any of the following statistical series, namely : 


(i) Individual Series or Ungrouped Data 
(ii) Discrete Frequency Distribution 


(iii) Continuous Frequency Distribution 


(i) Calculation of Arithmetic Mean in Individual Series: Following two 
methods are used for calculating arithmetic mean of an individual series: 


(a) Direct Method 
(b) Short-cut Method 


(a) Direct Method: The process of calculating arithmetic mean or mean in the 
case of an individual series is very simple. If the respective values of N 


observations recorded on a variable X are Xi, X ,..., Xy, then the mean, 
denoted by X, is defined by 
N 
X; 
= X,+ Xo 4+...4X j= 
och See il oley (1) 
N N N 


Here, EX = (X + Xo +... + Xn) is the sum of all the N values of the variable 
X. The formula in (1) can also be written as 


LX = NX (2) 


It is clear from formula (2) that if any two of the three values of the formula 
are known, the third one can be calculated. 


1 ` 
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Solved Examples 


Example 1: Calculate the arithmetic mean of the following marks in statistics obtained by ten 
students in a class. 


no ee ee 


Solution: 


Calculation of Mean using Direct Method 


rs a E 


l. Calculate £X by adding all the observations. 


2. Divide this total by N, the number of observations. In symbols, 
x-1 sx 
N 


Using formula (1), the mean becomes 


eee iy marks 
N 10 


(b) Short-Cut Method: The direct method of calculating arithmetic means is 
used when the number of items in the series is relatively less. If items are 
more and figures are large enough, the computation of mean becomes 
difficult. This difficulty can be solved by using the short-cut method. Under 
this method an assumed mean is taken as the basis of calculation. The 
assumed mean is usually chosen to be a neat round number in the middle of the range 
of the given observations, so that deviations can be easily obtained by subtraction. 
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Then, a short-cut formula, based on deviations from assumed mean, for 
calculating arithmetic mean is: 


X=At¢—= ..(3) 
N 
Here, X =the arithmetic mean, 
A =the assumed mean, 


d=(X-—A), the deviation of each value of the 
variable from the assumed mean A. 


Xd = the sum of deviations. 


Example 2: Use short-cut method to calculate arithmetic mean for the data in statistics obtain by 


ten students. 


ee e eee ee 


Solution: 


Computation of Mean by Short Cut Method 


Marks (X) Deviation (X - 18) 


In this example, the data ranges from 11 to 25. Therefore, 18, a neat round value in the 
middle of 11 and 25, may be taken as assumed mean, i.e. A =18. The deviations and sum 
of deviations needed in formula (3) may be calculated from the table given. 
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Using formula (3), the arithmetic mean will be 


-AŽ 
N 


=18+ (=) =17 marks. 
10 


It can be seen that the arithmetic mean obtained by the direct and the short-cut method is the same. 


Example 3: Show that the arithmetic mean of the series 


ont = 1) 


l, 2, 22, 29, 24... , 2” is 
(n+) 


[B.C.A. (Meerut) 2001] 
Solution: 


F : : sum of values of items 
Required arithmetic mean = ——_—_—_———_. 


number of items 
14242? eo? 4, eo" 
E (n+l) 

pet = 
_ 2-) 
— (n+) 


a -1) 
(n+ 1) 


Note: Sum of G.P. of nth term 
a(l—r” 
ae se 
l-r) 
where a = first term 
r = common ratio. 

Example 4: The mean of n numbers of a series is X and the sum of first (n — 1) numbers is à. Find 
the value of the last number. [B.C.A. (Delhi) 2005] 
Solution: The sum of the first n numbers 

= mean x (number of items) 

=¥Xn=nXx 
Also given sum of first (n — 1) numbers = A 


Required last number =sum of n numbers — sum of first (7—1) numbers 


=nx -À 
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(ii) Calculation of Arithmetic Mean in a Discrete Frequency Distribution 


The following three methods are used for computing arithmetic mean in a 
discrete frequency distribution: 


(a) Direct Method 
(b) Short-Cut Method 
(c) Step-Deviation Method 


(a) Direct Method: In discrete frequency distribution we multiply the values of 
the variable (X) by their respective frequencies (f) and get the sum of the 
products (£ fX). The sum of the products is then divided by the total of the 
frequencies, i.e. Lf =N. Thus, according to this method, the formula for 
calculating arithmetic mean becomes: 


ge A 


A (A) 


Here, £ fX =the sum of the products of observations with their respective 
frequencies. 


xf =N =the sum of the frequencies. 


Example 5: Following table gives the wages paid to 125 workers in a factory. Calculate the 


arithmetic mean of the wages: 


Wage R) 200 210 220 230 240 230 260 


[B.C.A. (Meerut) 2009] 


Solution: 
Computation of A.M. (Direct Method) 


Wages (X) No. of Workers (f) JX 


1040 


‘leas 
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Using the formula (4), the arithmetic mean will be 


y- ZfX _ 28490 
N 125 


7227.92 


(b) Short-Cut Method: As observed that short-cut method does not necessarily 
reduce the computational work in the case of individual series or ungrouped 
data. However, for grouped data, this method results in considerable saving 
of time. According to this method, the formula for calculating arithmetic 


mean is 
= xX fd 
X=A+t 2m (3) 
N 
Here, A = the assumed mean, 


d =(X — A), the deviation of X from the assumed mean A. 


È fd = the sum of the products of frequencies and their respective 
deviations. 


Example 6: What is the arithmetic mean of the following ? 


oa i 


[B.C.A. (Delhi) 2006, 2001] 


Solution: 


zf 


Required mean = SF 


eg 1+ ”C .2 +... "Con 
"Co + "C+ "Cy +... "Cn 


.3+..l.n 


z n(n—1) 94 n(n-l)(n-2) 
2! 3! 
sum of all binomial coefficient in the expansion (1 + x)” 


n|” ones n IC ql l ese ob a i | 
= gn 


_ n. [sum of all binomial coefficient in the expansion (1 + x) 4 


gn 


n.2"-1 p 
on 2 
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Example 7: Ifthe arithmetic average of data given below be 165 rupees, find the missing term. 


[B.C.A. (Rohilkhand) 2002] 


Solution: Here, taking z as the missing terms we have 


| 3000 


3000 


a ee ae 


Now, from Arithmetic average = = 
4 165 = 10700 + 10z 
80 
=> 165 x 80 =10700 + 10z 
=> 10z =13200 —10700 
10z =2500 

=> z= 2300 — =250 

I0 


Example 8: Calculate the arithmetic mean for the data in the following table given the wages paid 
to 125 workers in a factory by using shortcut method: 


Solution: In this case, assumed mean A is taken as 230 because it is neatly at the centre 


of the range of the variable X. The other needed calculations are given in the table. 


Measure of Central Tendency 


Computation of Arithmetic Mean (Short-Cut Method) 


Using formula (5), the arithmetic mean will be 


= fd =-260 
z fd 


X =A+——=230 + (SE) -230-208-2 227.92 
N 125 


(c) Step-Deviation Method: In short-cut method of calculating mean, the 
deviations taken from an assumed mean generally have a common factor. In 
continuous frequency distributions this common factor is nothing but the 
uniform class-interval. The computational work in short-cut method can be 
further simplified if the deviations obtained are divided by this common 
factor. The deviations divided by the common factor are called 
step-deviations. According to this method, the arithmetic mean is 
calculated by the formula: 


x fd’ 
X=Ath i (6) 
N 
Here, h =the common factor in deviations, d = (X — A) 
X-A 
t= 7 = j , the step-deviations. 
h 1 


xX fd’ = the sum of the products of the step-deviations and their 
respective frequencies. 


Example 9: Use step-deviation method for calculating arithmetic mean for the data in following 
table gives the wages paid to 125 workers in a factory: 
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Solution: While using short-cut method for computing mean in example 8, the 
deviations (d) have a common factor which is A = 10. Thus, the volume of the deviations 
may be further reduced if we divide these deviations by this common factor. The reduced 
deviations are called step-deviations and denoted by d’. The other needed calculations 


are given in the table below: 


Computation of Arithmetic Mean (Step-Deviation Method) 


Deviations Step-deviations 
Wages X | Workers f | d=(X-230) = 


Using formula (6), the arithmetic mean becomes 


Zfd’ 26 


X=A+th =230 +10 (- 23) -230 -2.08 = (@)227.92 


The value of X obtained by all the three methods is, of course, the same. 


(iii) Calculation of Arithmetic Mean in a Continuous Frequency 
Distribution: In continuous frequency distribution, the procedure of 
computing arithmetic mean is the same as in the case of discrete frequency 
distribution. The only difference is that in continuous frequency 
distributions the frequencies within each class are assumed to be distributed 
uniformly over its range. With this assumption, each class is then represented 
by its mid-point X. Using these mid-points of the classes and the corresponding 
frequencies, arithmetic mean for the continuous frequency distribution can be calculated 
by using any of the methods used in a discrete series. The following examples will 
clarify the procedure. 
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Example 10: Use direct method to calculate arithmetic mean from the following data: 


[Cus | 20-25 25-30 | 30-35 | 35-40 | 40-45 | 45-50 | 50-55 


mor Lo e e e ee 


Solution: For computing arithmetic mean for continuous series, we first obtain 
mid-points (X) for all the classes. After determining these mid-points, formula (4) is used 


for computing X. The calculations needed are shown in the table: 


Computation of Mean of a Continuous Series (Direct Method) 


Thus, the arithmetic mean will be: 


z- Z/X _2515 
N 7 


— =35.93 


Example 11: Use short-cut method for calculating mean for the data given by: 


[Cas | 20-25 25—30 | 30—35 | 35—40 | 40—45 | 45—50 | 50—55 


me e e e e e e 
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Using formula (5), the arithmetic mean can be obtained as 
x fd 
=A+— zi =37.5 + +(- )- 37.5 -1.57 = 35.93 
N 70 
The average is 35.93. 
Example 12: Obtain arithmetic mean for the data in example 2 using step-deviation method. 


Solution: 
Calculation of Arithmetic Mean (Step-Deviation Method) 


Step- 
es ae mea Deviations, | Deviations 
Classes 
d =(X-37.5) r= d 


Le e ae oe at 
so] s [2% | 0 |» [0 
p ae e ae ee 


Here, A =37.5 and h=5 Applying formula (6), the value of X will be 


ee oe al =37.5 +5 (= =)- 37.5 -1.57 =35.93 
N 70 
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Example 13: Find the average marks of the students from the following data: 


Solution: In this example a ‘less than’ cumulative frequency distribution is given and 
for computing the average, this should first be converted to a simple frequency 
distribution as shown in the table: 


Computation of Mean 


on using step-deviation formula for computing mean, i.e. 


X=A+h Zfd =35 +10 x LA > =35 +15 4 =50.4 marks 
N 250 


Example 14: The following data relate to the marks of 70 students in statistics. Find the mean: 


Marks (More than) 


No. of Students 


Solution: In this example, a ‘More than’ cumulative frequency distribution is given. For 


computing mean, the given distribution is converted into a simple frequency distribution 
as shown in the table. 
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Computing Mean 


More | More than 20 | 20 | 20-30 | 70 = | 70-63 =7 | ii ew 


a TEE 
Crena fof Tater a o 
ooo | foo O ee 


X=A+h =L +10/ =2)- 55 -7.43 =47.537 Marks. 
N 70 


2 When Inclusive Classes are Given: Calculation of arithmetic mean remains 
unaffected even if the frequency distribution with inclusive classes is given. It is 
because of the fact that the mid values for the inclusive and the corresponding 
exclusive classes remain the same. The point will be clear from the following 
example. 


Example 15: Calculate arithmetic mean from the following distribution: 


o> ele ale 


[B.C.A. (Meerut) 2007] 


Solution: 
Calculation of Arithmetic Mean 


A Sas > -4t 5 
Classes Mid-Value X 


Pe T) e 


Here, A=44.5, h=10 


X=A+h Zù =44. 5 +10(= =)- 44.5 -2.75 =41.75 
N 80 


ee <a 
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3. Mathematical Properties of Arithmetic Mean: The arithmetic mean has the 
following important mathematical properties: 


(i) 


For Example: (i) Here, X= 


The sum of the deviations of a given set of observations from the arithmetic 


mean is equal to zero. Symbolically, this property can be written as 
x(X — X) =0 


Because of this property, mean is called the point of balance or the centre of 
gravity as the sum of positive deviations from the arithmetic mean is equal to the sum 
of the negative deviations. 

xX 200 


=— = 40 
N 5 


and the sum of deviations from arithmetic mean, i.e. 


X(X — X) =0 


The sum of squares of deviations of a set of observations from arithmetic mean is 
minimum. In other words, the sum of squared deviations from mean is less 
than the sum of the squared deviations from any other value. Symbolically, 


[B.C.A. (Meerut) 2004, 2003] 
SX - X? < XX - A’ 


For verification of the property, let us consider an example. 
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5 


Here, X= is 


Also, the sum of squared deviations from mean, i.e. E(X — X 2 =250 


Further, let us consider an arbitrary value, say A = 10. It is observed from the table that the 
sum of squared deviations from A = 10, i.e. XX - Ay = 375. Thus, 


E(X - X} =250 < E(X -10)? =375. 


(iii) If every value of the variable X is increased (or decreased) by a constant 
value, the arithmetic mean of the observations so obtained also increases (or 
decreases) by the same constant. 


Symbolically, 
Let, Y =X +b where bis a constant. 


LY =X(X+b)==X+ Nb 


==“ t+h orY=aXth 


For Example: Here 5 observations on the variable X are given. A new variable Y is 
obtained adding 2 (a constant) to each observation on X. Again, another variable Z has been 
defined by subtracting 3 from each observation on X. 
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Now, x= 2X = 30 =6 
N 5 
Y = 2 8 
N 5 
Z= A = 15 =3 
N 5 


On the other hand, using property 3 above, the values of Y and Z can also be obtained as 
Y=X+2=6+2=8 (Adding constant 2 to X) 


Z=X-3=6-3=3 (Subtracting constant 3 from X) 


(iv) If the values of the variable X are multiplied (or divided) by a constant, the 
arithmetic mean of the new observations can be obtained by multiplying (or 
dividing) the initial arithmetic mean by the same constant. Symbolically, 


Let, Y =kX,k being a constant 
LY = TkX =kxX 
ZY EX 

or =~ 
N N 

or Y =kX 


For Example: Suppose 2, 6, 8,4, 10 are observations on the variable X. A new variable Y 
is obtained by multiplying each observation on X by a constant 3. Also, another variable Z is 
obtained by dividing each observation on X by the constant 2. Then, the values of 
observations on Y and Z are shown in the table: 


5 

y-2Y 2% _186 
N 5 

and 727") 3% 
N 5 
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Alternatively, on using property 4 above, Y and Z can also be obtained as 


Y =3X =3 x6 =18 (on multiplying X by 3) 
Z=^-= s =3 (on dividing X by 2) 


(v) If the arithmetic means and the number of observations of two or more 
related groups are known, we can calculate the combined arithmetic mean of 
these groups. The combined mean formula for two related groups is as 


under: 
> _ NX, +N Xx: 
Xo =e aT) 
Nı + No 
Here, X12 = Combined mean of two group 


N; = No. of observations in first group 

Ns = No. of observations in second group 
Xi = mean of the first group 

X 


mean of the second group 
Similarly, for three related groups, the formula becomes 
= NX + No Xo + N3X3 
X123 E “M+ Ny + No 
l + No + N3 
The formula can also be extended for k-groups as 
NiX + No Xo te ae N; X, 


Xa = 
Teas Ni + No +...+ Ny 


..(9) 
Here the symbols have their usual meanings. 


Example 16: The mean marks of 60 students in section A is 40 and mean marks of 40 students in 
section B is 45. Find the combined mean of the 100 students in both the sections. 


Solution: Here, N; =60, Ns =40, X 40 and X =45. Using formula (7), the 


combined mean of all the 100 students will be 


=> NX +NəX%, 60x40+40x45 2400 +1800 
X19 SAALTA L SARAAN A AA a =42 marks. 

Ni + No 100 100 
Example 17: The mean wage of 100 workers in a factory running two shifts of 60 and 40 workers 
is 38. The mean wage of 60 workers in the morning shift is X 40. Find the mean wage of 40 workers 
in the evening shift. 


Solution: Here, N; = 60, No =40, X; =40 and X} =38. We are required to find the 


value of X3. Using the formula (7) for combined mean, i.e. 
> NiX + Np Xo 
X25- NN 
1+N2 
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We have, 38 _ (60 x40) + 40 X3 


100 
or 3800 — 2400 =40 X> 
or X> = 1400 =735 

40 


So the mean wage of 40 workers in the evening shift is Ẹ 35. 


Example 18: Find the combined mean from the following data: 


Solution: Here, we are given data related to three groups which can be symbolically put 


as 


N; =200, Ny =250, Ng =300, X =25, X, =20, X; =15 
For combined mean, we put these values in formula (8) and get 


Fe _ NX, + No Xo + NX; _ 200 x25 +250 x20 + 300 x15 
123 Ni tN;+Ny 200 + 250 + 300 


_ 5000 + 5000 +4500 _ 14500 
E 750 ~ 750 


=19.33 


Example 19: The arithmetic average of a group of 50 items is 12 and that of another group of 40 
items is 14. Find the arithmetic average of all the 90 items. 


Solution: The required mean = Ane where fi, fa be frequencies and Xi, X> be 
1t Jo 


respective arithmetic average. 
_ 50x12 +40 x14 _ 600 + 560 


50 +40 © 90 
-1160 _116 i9 9 
90 9 


Example 20: The weight of 150 students in a certain class is 60 kilograms. The mean weight of 
boys in the class is 70 kilograms and that of the girls is 55 kilograms find the number of boys and 
girls in the class. [B.C.A. (Meerut) 2004, 2003] 


Solution: The number of students N =150 
The mean weight of boys = X} =70 
The mean weight of girls = X> =55 
Let number of boys = N} 


ao 
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Let number of girls = No 


Ni + No =N > N, + No =150 wath) 
Also X19 = 60 
We know that _ _ 
z, Mx +N% 
60 = 70 N; +55 No 
150 
> 70 Nj +55 No =9000 
> 14 N; +11N =1800 a2) 
From (1) 14 N; +14 Nə =2100 (3) 
Subtracting = 3N =300 
> Nə =100 
and Nı =150 - No =150 -100 =50 
The number of girls Ns =150 - N; 
=150 -50 
=100. 


Example 21: A person makes a trip which entire travelling 900 kms by train at an average speed 
of 60 kph, 300 kms by bus at an average speed 25 kph, 400 kms by plane at 350 kph and finally 
15 kms by taxi at 25 kph. What is the average speed of the entire distance. 

[B.C.A. (Meerut) 2006, 2008] 


Solution: 


Journey by | Distance (km) | Average Speed (kph) | Time Taken (Hrs) 


Bus 


Average Speed = distance = 1006 = 1615 x35 
time 35 1006 
_ 56525 


~ 1006 
=56.19 km/h 
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And the time can be obtain by 
. distance 
Time = ——— 
average speed 


Example 22: A person travels n equal distances with velocities v,, vo ... Vy. Show that his average 


(M +n +... Py) 


velocity V, V9, V3,... , Vy - Show that his average velocity cannot exceed . When it 


n 


will be equal to this value ? [B.C.A. (Meerut) 2002] 


Solution: Total distance = n equal distances, velocity per distance 1, v9, V, 


Total distance distance 
Average Speed =————_———,, time = ———— 
Total time speed 


ae : speed n +v +... +V 
per unit distance time = PL = een 
distance n 


Y +v +... +Y 


This speed is maximum = ” < Total time 


n 


gat tet On 


Thus, maximum average spee 
n 


Example 23: In a class of 50 students, 10 have failed and their average of marks is 2.5. The total 
of marks secured by the entire class was 285. Find the average marks of those who have passed. 


Solution: Since the average marks of 10 failed students is 2.5, therefore, the total marks 
obtained by 10 students who failed =10 x2.5 =25 
Now, the marks obtained by all the 50 students=285 

the marks obtained by N =40 students who passed = XX =285 -25 =260 


x 
the average of marks of these 40 students = N =—— =6. 


Example 24: In the examination, the average grade of all students in class A is 68.4 and that for 
students in class B is 71.2. If the average of both the class combined is 70, find the number of 
students in class A to the number in class B. 


Solution: Let the number of students in class A and B be N} and No respectively. We 


are also given Xj) =70, X; =68.4, X9 =71.2. Putting these values in the combined 
average formula 


> NX, +N% 
“2 Ne 
[EN 
68.4 N; +71.2 N 
or iis A 2 
N, + No 
or 70(N, + Ny) =68.4 N; +71.2 No 


Dividing both sides by N3, one gets 


70 N 4i -68.4 ÒL +71.2 
No No 
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Ni (70 - 68.4) =71.2 -70 
2 
or 1.6 N =1,2 
No 
Thus, the required ratio, i.e. Ni = = = es 
No 16 4 


Hence, N} and N3 are in the ratio 3 : 4. 


Correcting the Arithmetic Mean 


For correcting the incorrect value of mean, first we find the corrected XX or XfX (in case of discrete or 
continuous series). For this we subtract the wrong items from the incorrect XX (or XfX) and add it to 
the correct items. Finally, on dividing the corrected XX by the number of observations, we get the 
corrected mean. 


Example 25: The average marks of 80 students were found to be 40. Later, it was discovered that 
a score of 54 was misread as 84. Find the corrected mean of the 80 students. 


Solution: We are given N =80, X =40 


Since Xx = 
N 


EX = NX =80 x40 = 3200 
But due to the error discovered, LX =3200 is not correct. 
The correct XX = incorrect XX — misread observation + correct observation. 


=3200 -84 + 54 =3170. 


Corrected £ X _ 3170 
a = 


The corrected average X= =39.625 


Example 26: Mean of 100 items is found to be 30. If at the time of calculation, two items are 
wrongly taken as 32 and 12 instead of 23 and 11, find the correct mean. 


Solution: Given that N =100, X =30 


EX = NX =100 x30 =3000 (Incorrect total of 100 items) 
Corrected £X = Incorrect XX — wrong observations + correct observations 


= 3000 — (32 + 12) + (23 +11) =29.90 


Corrected mean = Com PUEA = 2220 =29.90 
N 100 
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Example 27: The arithmetic mean of 50 students was given 44. But later on it was found that 
marks of a student which were read wrongly as ‘54’, there were actually 34. Now calculate the 
correct mean. 
[B.C.A. (Meerut) 2004] 
Solution: 
The number of students N =50 
Incorrect mean X =44 
Incorrect marks =44 
Correct marks =34 
Incorrect XX 
= 
Incorrect LX =44 x50 =2200 
Correct EX = 2200 -54 + 34 =2180 
2180 


Correct mean (X) = ~— =43.6 
50 


Incorrect X = 


Example 28: The average monthly wages of all the workers in a factory is X 444. If the average 
wages paid to male and female workers are X 480 and X 360 respectively, find the percentage of male 


and female workers employed by the factory. [B.C.A. (Meerut) 2008, 2006] 
Solution: Let total workers = x, then 444 = =< => Èr =444 x 
x 
Again 480 = al 
x 
> Ery =480x, and Xr =360 x9 
Xr = Ey + Ew 
where xı = number of males, xy = number of females 
> 444x =480 1 + 360 (x - 4) 
=> 7x =10% 
7, 
Male percentage = +L ¥100 = i asl x 100 = 70% 
X 10 x“ 


Female percentage = 30% . 


Example 29: Average rainfall of a city from Monday to Saturday is 0.3 inch. Due to heavy 
rainfall on Sunday, the average rainfall increased to 0.5 inches. What was the rainfall on Sunday ? 


Solution: Given that the average rainfall from Monday to Saturday is 0.3 inch. 


Thus, the total rainfall for six days = NX =6 x0.3 =1.8 inches 
Also given that the average rainfall for all the seven days is 0.5 inch. 
n Total rainfall for seven days = 7 x0.5 =3.5 inches 


The rainfall on Sunday =3.5 -1.8 =1.7 inches 


€, Elements of Statistics 


Merits of Arithmetic Mean 


The merits and demerits of arithmetic mean can be discussed in the light of 
characteristics listed for a good average. 


i) Itis simple to understand and easy to calculate. 


( 
(ii) Itis rigidly defined and so its value is always definite. 
(iii) Its computation is based on all the observations. 

( 


iv) Itis also capable of being handled algebraically, i.e. its further algebraic treatment 
is possible. 


(v) Its value is least affected by fluctuations of sampling. In other words, we can say 
that the arithmetic mean has the minimum sampling variability which is better for 
drawing inferences about the population. 


Demerits of Arithmetic Mean 
(i) The value of arithmetic mean is highly affected by extreme values of the variable. 


(ii) | Arithmetic mean cannot be calculated if even a single observation in the series is 
missing. Also, in a distribution with open-end classes the mean cannot be 
determined without making certain assumptions regarding the class-intervals of 
the open-end classes. 


(iii) Arithmetic mean can be a value that does not exist in the series. 
4.2.1.2 Weighted Arithmetic Mean 


Calculation of Weighted Mean 


The formula for calculating weighted arithmetic mean is as follows: 


= _EWX 
= Sar ...(10) 
Here, X, = the weighted arithmetic mean 


W = the weights attached to values of the variable 


X = the values of the variable 


Following few examples will further clarify the uses of weighted mean. 


Example 30: Find the weighted price mean by weighting each price by the quantity consumed for 
the data given in the table: 


Quantity Consumed Price (%) (per kg.) 


eee 
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Solution: 


Computation of Weighted Mean 


The weighted mean price will be 


z, -ZWX _3665 26 94 
EW 590 


l. Harmonic Mean: Let x, x9 ,... , X„ be nvalues of a variate x. The harmonic mean is 


the reciprocal of the arithmetic mean of the reciprocals of the given values i.e. 


l l l Į 
—+— +.. +— E 
Xx) X2 Xn Xi 
If fis fo...-, fn are the frequencies of x, X2 ,... , X respectively, then 


efit tet fn N 
a. eae 


$ 
4 V A Xi 
If w, W ,...,W, are the weight of the values x1, x9 ,... , Xy respectively, then 
_MEmM t+...+ Wy, N 
Mygr ga 
4 V Xs Xi 
Merits 
(i) Itis base on all the observations of the series. 
(ii) It is capable of the mathematical treatment. 
(iii) It is not very much affected by the fluctuations value. 
(iv) It is rigidly defined. 


Demerits 
(i) Itis difficult to calculate. 
(ii) It is not readily understood. 


(iii) It gives higher weight to the smaller values. 
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Geometric Mean (G.M.): The geometric mean of ‘”’ numbers is defined as the 
“nth root of the product of ‘”’ numbers”. It is found out by multiplying all the 
values of a series and extracting nth root of the product. 


Symbolically G.M. = n 4J (x1 . X2 . X3... Xn) 


where, n = number of items. 
41,49 ,...,X, be n, non zero observations. 
log x + log x +... + log x 
or G.M. = Anti log of t a ee 
N 
2 log x 
N 


Uses of Geometric Mean 


(i) 


(ii) 


Geometric mean is highly useful in averaging ratios percentages and rate of 
increase between two periods. 


Geometric mean is important in the construction of index numbers. 


Merits of Geometric Mean 


(i) 
(ii) 
(iii) 
( 


iv) 


It is based on all observations. 
It is rigidly defined. 
It is less affected by the extreme values. 


It is useful in studying economic and social data. 


Demerits of Geometric Mean 


(i) 
(ii) 
(iii) 


It is difficult to understand. 
Non-mathematical persons cannot do calculations. 


It has restricted application. 


Method of Calculation of Geometric Mean 


(i) 
(ii) 
(iii) 
(iv) 
(v) 


Find out logarithm of each value of the size of item from the log table = log x 


Add all these values of log x = È log x 


The sum of log (È log x) is divided by the number of terms = Ele 


Find out antilog of this, we get geometric mean of the data. 


If x1, x9 ,...,.X, be number of variables with frequencies fi, fo,..., fa then G.M. 
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this formula is used for continuous series. 


(vi) IfG,and G be geometric means of two distributions in which the number of times 
are m and n respectively then geometric mean G of the combined distribution is 
given by 


m log G; +n log G 
brge. g G g G2) 


(m+n) 


Example 31: The daily wages of there persons are X 6, 9 and 13.5. Find the geometric mean of 
these wages. 


Solution: If G be the required geometric mean, than 


l 
G=(6x9x13.5)3 


1 1 


-(6 xox) =(27 x27)3 


1 
=(3x3x3x3 x3 x3)3 


=3x3=%9 


Example 32: Calculate the geometric mean of the following 2574, 475,75, 5, 0.8, 0.08, 0.005, 
0.0009. 


Solution: Here, we have 


[ Total = log x =2 .1220 
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Now, if G be the required geometric mean, than 
1 N 
logG= N 2 log x; 
j= 


== [2.1220] =0.2652 


G = Antilog (0.2652) =1.841 


Example 33: Find the geometric mean for the following data: 


Classes 100-200 | 200-300 | 300-400 | 400-500 | 500-600 


a acl 
(f) 


100-200 


300-400 76.3230 
400-500 53.0640 


= f logio x =251.7775 


200-300 P 43.1622 


(x) 
150 
250 
350 
450 
550 


Using the formula for geometric mean 
1 
logio G= 9 $, f logio # 
zal (251.7775) =2.5178 
100 


G = Antilog (2.5178) =329.5 
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Example 34: Calculate the geometric mean from the following table: 


Solution: Here, we have 


If G be the required geometric mean, then 
log G -> Xf (log x) 


“= [27.6725] 


log G =1.1069 
=> G = Antilog (1.1069) =12.79 


Similar Problem: Find the G.M. for the following frequency distribution: 


[B.C.A. (Meerut), 2009] 


Example 35: Find the harmonic mean of 12, 8, 6 and 24. 


Solution: Let H be the required harmonic mean. 


Than 
1 afl 1 1 1] 
— =|= ++ + 
H 4|12 8 6 24 
=> (0. 0833 + 0.1250 +0.1667 + 0.0417] 
=> (0. 4167] =0.1042 
or l NOES 9.596 


-0.1042 1042 
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Example 36: Find the harmonic mean from the following table of marks obtained in class test: 


If H be the required harmonic mean, we have 


=f 25 250000 


H= =12.746 


A “1.9614 19614 


Xx 


Example 37: A variate takes values a, ar, ar? aaar" -l each with frequency one. Show that the 


ame 
n(l-r) 
ot) ok et 
The G.M.=ar 2 andthe H.M.= a 
ZF 
i _ (2 
Prove that AH=G Sia 
2 „2 nl 
Solution: (i) A= at+ar+ar +...+ a 


Measure of Central Tendency 


1 
=[a" yl +2 +3 +...4(-Iny 


and H= ioro o n 


1 1- l 1 
= Pg er et 


an an (1—r) r”! 


l l-r” 
aa 


n nal (n-1) 
_4a(l-7 ) an(l-r)t = ey") olay 2 -G2 
n(l-r) (l—r”) 


AH =G? 


Example 38: Find the average rate of (a) motion in the case of a person who rides the first mile at 
10 m.p.h., the next mile at 8 m.p.h. and the third mile at 6 m.p.h.; (b) Increase in population which 


in the first decade had increase 20%, in the next 25% and in the third 44%. 
[B.C.A. (Delhi) 2007, 2 


Solution: 


(a) In that case, the harmonic mean being the suitable average, the average rate of 


motion 
Total distance _ 3 _3x120 _ 360 


~ Total time taking ‘ae 47 47 
10 


=7 


on tae! 


1 
++ 
8 


(b) Here G.M. is the appropriate average. 
l 
G =(20 x25 x44)3 


log G = L [log 20 + log 25 + log 44] 


== [1.3010 + 1.3979 + 1.6435] 


= 5 x 4.3424 =1.4475 = log 28.02 


> G =28.02 = The average increase in population. 
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004, 2003] 


.66 m. p. p. 


O 
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Example 39: A motor when travelling from rest travels the first twentieth of a mile at 6 miles per 
hour and the next three twentieth of the mile at 8, 12, 24 m.p.h respectively. But its average speed 
over the first one-fifth of a mile is not 12.5 m.p.h. but 9.6 m.p.h. Explain the apparent paradox. 


[B.C.A. (Rohilkhand) 2004] 


Solution: Let A and H are the arithmetic and harmonic means of speeds 


Ano t8tl2 +24 50 _ 


12.5 
4 4 


4x24 
Ge 4 x24 48 


T I I % 5 


Lp. 
6 8 2 24 


It is obvious from the data that the average speed would not be the arithmetic mean, but 


it is the harmonic mean, because 


Total distance travelled 


The average speed = ————— 
Total time of journey 


ee: 
1 1 l l 
— + — + — + — 
120 160 240 480 


-80 oN 2g 6=H.M. 
5x10. 5 


Example 40: A man motors from A to B. A large part of the distance up-hill and he gets a mileage 
of only 10 miles per gallon of gasoline. On the return trip he makes 15 miles per gallon. Find the 
harmonic mean of this mileage. Verify that this is the proper average to be used here, assuming that 


the distance from A to B is 60 miles. 


Solution: Here, the Harmonic mean, 


2 _ 2x30 


=12 mileage per gallon. 


The total distance covered =120 miles (both ways journey) 


The average consumption = — =12 gallons per mile = H 


Hence, harmonic mean is the proper average here. 


[B.C.A. (Kanpur) 2007] 
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Example 41: If the price of a commodity doubles in period of 4 years from 1997 to 2001. What is 
the average percentage increase per year. 


Solution: Here a =2 andn=4 


x =(2)4 , log x -1 log 2 = — (0.3010) =0 07525 


1 
4 
x =anti — log (0.07525) =1.190 
r =1.190 -1=0.190 


Average percentage increase =0.190 x100 =19 


Example 42: The geometric mean of a numbers is x and that of b numbers it is y, obtain the G.M. 


of those a + b numbers combined. 


1 
Solution: (Product of a numbers) 4 =x 
2 
and (Product of b numbers)? = y 
> Product of a numbers = x“ 
and Product of b numbers = y 


The product of (a + b) numbers combined = x“ y? 
l 
G.M. of (a + b) numbers = (xf y)" +b 


Example 43: Find the geometric mean of the series 1, 2, 4, 8, 16, ... ,2” 
1 
Solution: G= [2° gl 92 93 gmt) 
1 
(2! #2 ES Pi. Ayal) 


1 


n(n+l1) (+1) n 
=|(2) 2 =22 


Example 44: The rise in prices of a certain commodity was 5% in 1994, 8% in 1995 and 77% 
in 1996. It is said that the average price rise between 1994 was 26% and net 30%. Justify this 
statement and show that how you would explain it before a layman. 


Solution: The prices in 1994, 1995, 1996 were 105, 108 and 177 respectively, if it was 
taken as 100 for 1993. Thus, the average of prices for these years would be given by the 
geometric mean of 105, 108 and 177. Let G be the geometric mean. 
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1 


Then, G =(105 x108 x177)3 
log G = 5 [log (105) + log (108) + log (177)] 
= > [2.0212 + 2.0334 + 2.2480] 
= 5 X6.3026 =2.100 g 


G =antilog (2.100 g) =125.9 =126 
Thus the average price rise = (126 — 100) =26% 


If the arithmetic mean n be taken as the average of prices. 


Then M == [105 + 108 + 177] =130, 


Giving the rise = 30% . But if we assume this rise of 30% in every year, the price in 1996 


< 130 130 | 130 5) 


— X —— X —— of 
100 100 100 


=100 


Hence the average price rise between 1994 — 1996 is not 30%. 


Example 45: If there are r series of observations N), No ..., N,, the mean mof the whole series is 
related to mean My, Mo ,..., of the component series by the equation. 
n 
NM =N; M; + No My +... + N,M, where N = > N, 
F= 


[B.C.A. (Delhi) 2008; B.C.A. (Kanpur) 2006; 
B.C.A. (Rohilkhand) 2002] 


Solution: If Mj where j =2,3,4,...,.N, denote the values of the variables in the first 


series, then the arithmetic mean of this series is given by 


or NM, = Xij sa(1) 


Similarly if X9 jp j =1,2,3,...,. N denote the values of the variables in the second series, 
then the arithmetic mean of this series is given by 
No 


N M3 =X X2; (2) 
jal 
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Similarly, 


Adding (1), (2), (3) we get 
Ny N N, 


Ni M, + Nọ Mo + N3 M3 +..+N,M, =>, ity) xyjt ty Xyj 
j=l j=l j=l 


= Sum of the values of the variables in the series 
=NM 
Where, M = The mean of the whole series 


Theorem 1: The algebraic sum of the deviations of all the variate values from their 
arithmetic mean is zero. 


Proof: Let 11, x9 .... , Xp be the values of the variables x with corresponding frequency 


fio fa vee fa Then 
pe LG E, (1) 


Deviation from ¥ are given by d = x — M, then 
Lfd =Uf(x-—xX)=Ufx -—XUf =xUf -xUf =0 


Theorem 2: If every value of the variable is increased by some constant a then arithmetic 
mean is also increase by a. 


Proof: Let 1, 1 ,...,%, be the values of the variable x with corresponding frequencies 


Kio fa veo fe Then wt 
sarga pe es Ae 
À POE af S 3 If 


ll 
=I 
t 


4.2.2 Median 


In a given ‘array’ i.e. when observations are arranged in an ascending or descending order, 
every observation holds a certain rank, be that first, second, tenth or forty-second. One 
such point which divides the array into two equal parts, so that exactly one-half of the observations 
are below, and one-half are above the point, is called median. Since the median denotes the 
central position of an observation in an array, it is called a position average. According 
to Connor, “The median is that value of the variable which divides the group into two equal parts, 
one part comprising all values greater and the other values less than the median”. 
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Determination of Median in Individual Series 


The determination of median in an individual series involves the following steps: 


(i) The data are arranged either in ascending or descending order. 


th 
(ii) | Locate median by the value of AS) item in the order. N being the number of 
observations in the series. 


Example 46: The following data relate to the height of nine students in a class. Compute the 
median height: 


o ec ee EC 


mornan [foe [or fm e e [oe [oo [oe 


Solution: For computing median height, we first arrange the data in an ascending order: 


142, 144, 146, 147, 149, 150, 151, 153, 154 


N+ = ae = 5th item in the 


As a second step, we determine by finding the value of 
above order. 


Median=Size or value of 5th item =149 cm. 


Example 47: Calculate median from the following data: 


me Pepe Pp PPP e 


a dd e ee 
Determine the median. 


Solution: First we arrange the given marks in ascending order: 


2 


N 1)" (2H 


th 
The median is the value ot( a =5.5 th item. 


value of 5th item + value of 6th item 
2 


Median = value of 5.5th item = 


a= == =26.5 marks. 
2 2 


i 
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Example 48: The following table represents the marks obtained by a batch of 20 students in 
certain class tests in Statistics and T 


[Serial No. | 

Mark 

m sl salala |) aon eee læ lal ae 
(Statistics) 

Mark 

i sl s l æl a l a leaa l A 
(Accoun oat ) 


[Serial No. e| 

Marks æ || æa | s læa lal a l Ses res | Se 
(Statistics) 

arie wa a | ae | o ea | a | Se ol se 
(Accountancy) 


Indicate in which subject is the level of knowledge higher? Give reasons. 


Solution: The use of median is recommended when exceptionally large or small values 
occur in the series. It is also used when one is interested in comparing the average 
performance of students in different subjects, colleges etc. In view of this median is the 
most suitable measure of central tendency in the present example. For determining 
median marks in the two subjects, the marks are first arranged in an ascending order as 
follows: 


Computation of Median 


Marks in Marks in S. No. Marks in Marks in 
a ET aa TUENA 


Here, we have 20 observations in both the series. So the median will be the value of 


th th 
(=#) -(2#) =10.5th item 


2 
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value of 10th item + value of 11th item 
2 


Median = value of 10.5th item = 


45 +46 


Median marks (Statistics) = =45.5 


39 + 42 -40.5 


Similarly, Median marks (Accountancy) = 


The comparison of median marks in the two subjects shows that the level of knowledge is 
higher in Statistics. 


Determination of Median in a Discrete Frequency Distribution 


In case of a discrete frequency distribution, the procedure of determining median is 
fundamentally the same as that for an individual series. The procedure consists of the 
following steps: 


(i) Arrange the data in ascending or descending order or magnitude. 


(ii) Obtain the cumulative frequencies. 
th 
(iii) Determine the size of (=#) item. N being the total frequency. 


(iv) Median is located at the value of the variable in whose cumulative frequency the 


th 
value of (=#) item falls. 


Example 49: Find the median size of the following series: 


sw fe e e o e e 


CM «|e |so|a]ajļu]> | 


Solution: As a first step, compute ‘less than’ cumulative frequencies (C.F.) as shown in 
the table. Then, for locating median, we find the value of 


Computation of Median in Discrete Series 
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Nel th 100 +1 th 
í + ) -( i ) =50.5th 
2 2 


item. Then, as a last step, median is located at the value of the item in whose cumulative 
frequency the value of 50.5 th item falls. 


Median = 7 
Determination of Median in a Continuous Frequency Distribution 
In the case of a continuous frequency distribution, we first locate the median class by 


th 
cumulating the frequencies until (=) point is reached. Finally, the median is 


determined within this class by using an interpolation formula. The procedure thus 


involves the following steps: 


(i) | Compute cumulative frequencies. 


N th th 
ii Find the size of | — item. See that is not used in this case. 
(ii) 9 


(iii) Locate the median class in cumulative frequency column where the size of 


th 
(=) item falls. 


(iv) Obtain the median value by applying the formula: 


D 
Median = 4 + 2— (h — 4) (11) 
f 
Here, | = lower limit of the median class. 


h = upper limit of the median class. 
f = frequency of the median class. 


c = cumulative frequency of the class preceding the median class. 


Example 50: From the following table find the value of median: 


11-15 | 16-20 | 21-25 | 26-30 | 31-35 | 36-40 | 41-45 | 46-50 
IU 


paeo tet» [e ee e fe 


[B.C.A. (Meerut) 2007] 


Solution: Here inclusive classes are given. Therefore, for median determination, 
inclusive class limits need to be converted into class boundaries as shown in the table on 
the next page. 
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Calculation of Median (Continuous Frequency Distribution) 


Class Boundaries | Frequency (f) | Cumulative Frequency 
(f) 


Cao e a 
Ca se |e fe 
ae es e [= 
Pam e [ef * 


Pos [ens [os Pe 
Pace [ces [oe fe 
Tae feces [ow fm 
Tce [oes [ef 
ee 


Median is the value of - = — =65th item which lies in the class (30.5 -35 .5). Thus, 


(30.5 -35.5) is the median class. For determining the median in this class, we use 
interpolation formula (11) as under: 


|z 


= 
Median =}, + 2— (h -4) 
f 
65 -56 
35 


=30.5+ (35.5 -30.5) 


9x5 


=30.5 + =30.5 +1.29 =31.79 


Example 51: Calculate the median for the following: 


mm |o |e o o e e ee 


wasa |o [e fo Lo e e [oo 


eo 
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Solution: In such examples, we first convert the given cumulative frequency 


distribution into the simple frequency distribution as shown in the table on the next 
page. 


Now, median is the value of A eel =125th item which lies in (50 —60). Thus, 


(50 — 60) is the median class. Using formula (11) 


AN ai 
Median = 4 + 2— (h -4) 
2564 Ay 

31 

290 


=50 + —— =50 +9.35 
31 
=59.35 


Example 52: Following is the distribution of marks of 50 students in a class: 


Calculation the median. 


Solution: In this example, we are given ‘more than’ cumulative frequencies. To get the 


value of median, the given distribution is changed to a simple frequency distribution as 
shown in the table: 
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Median is the value of ` = Z =25th item which lies in (20 — 30) class. For calculating 


median, we use formula (11) ie. 


4 


50 -46 =4 


46 -40 =6 10 


40 -20 =20 30 


20 -10=10 40 


10-3 =7 


N 
Median = J, + 2 


(b =h) 


where, h =20, h =30, Č =25, f =20,¢=10 


Median = 20 + coum 


(30 - 20) 


=20+ DO 259 +7.5 =27.5 
20 


Example 53: Determine median for the following income distribution: 


below 100 | 100—200 | 200—300 | 300—400 | 400—500 | above 500 
Groups 


No. 
Persons 


Solution: Open-end distribution do not present any difficulty in the location of median 


as will be clear from the following: 


Since median is the value of = — =50th item which lies in the group (300 — 400). 


Thus 300 -400 is the median class. Finally, the median is determined by using the 


interpolation formula. 
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Determination of Median 


=300 + 56.67 == 356.67 


Merits of Median 


(i) Itis easy to calculate and understand. In some cases it can be located simply by 
inspection. 


(ii) It is a truly defined average as it is the central position of the given data. 


(iii) Median can be defined for qualitative data and it is possible to rank the 
observation in some order. 


(iv) Itis not affected by extreme observations. 
(v) The value of median can also be located graphically. 


(vi) Median is specially useful in the case of open-end distributions. 


Demerits of Median 


(i) For locating median, data have to be arranged in ascending or descending order 
which is quite tedious for a long series of observations. 


ii) Its determination is not based on all the observations. 
iii) It is not capable of further algebraic treatment. 
iv) Its value is affected by fluctuations of sampling. 


v) In a continuous series, median is calculated by using an intropolation formula. 
This formula is based on the assumption that all the frequencies of the class are 
uniformly distributed over the interval which is not always true. 


Gao) Elements of Statistics 


4.2.3 Mode 


It is defined as the value of the variate having the maximum frequency. 


According to Kenney and Keeping. “The value of the variable which occurs most 
frequently in a distribution is called the mode”. 


The certain cases the maximum frequency may exist in more than one class. In such case 
there would be more than one mode and the distribution would be bi-modal or multi 
modal. This generally happens when two a more set of observation are combined in one. 
For example, wages of skilled and unskilled workers of a factory. 


Uses of Mode 
(i) Itis used when a quick and approximate measure of control tendency is required. 


(ii) When the measure of central tendency of the series is one which is most frequently 
occurring. 


Determination of Mode in Individual Series 


For an individual series the mode can be located simply by inspection. Here the data can 
be arranged in an array and then count the frequencies of each variate. The variate having 
maximum frequency is the mode. Thus, in fact, we always need grouping for mode, since 
without grouping there would be no frequencies. The following example will clarify the 
point. 


Example 54: Find the mode of the following array of an individual series of scores: 
9,9, 10, 11, 11, 11, 12, 14, 14, 17 


Solution: Here there are only ten observations in the array, we need not put the data in 
the form of a frequency distribution and mode can be determined by inspection only. In 
this case, the observation 11 has the maximum frequency 3. Therefore 11 is the mode. 


Mode in Discrete Distribution 


In the case of a grouped frequency distribution mode can be located simply by 
inspection. Here the variate having maximum frequency will be taken as mode. 


Example 55: Determine mode from the following distribution: 


Solution: In the above discrete frequency distribution the variate value 18 has the 
maximum frequency 21. Therefore, 18 is the mode of the given discrete distribution. 


f 
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Mode by Grouping Method 


In a uni-modal distribution where the distribution has only one mode and the highest 
concentration of values is around one value only, we do not face any difficulty in locating 
the modal value. However, the difficulty is experienced when nearly equal concentration 
of the frequencies is observed on two or more neighboring values. In such cases attempt is 
made to find the value of concentration with the help of grouping method. Jn grouping 
method the values are first arranged in ascending order and their frequencies are noted. Normally, 


the grouping table has the following six columns. 
Column 1 : The maximum frequency is marked by putting a mark or a circle. 
Column 2 : The frequencies are grouped in twos and the highest total is marked. 


Column 3 : Leaving the first frequency, the remaining frequencies are grouped in twos 


and the highest total is marked. 
Column 4 : The frequencies are grouped in threes. The highest total is marked. 


Column 5 : Leaving the first frequency, the remaining frequencies are grouped in threes 


and the highest total is marked. 


Column 6 : Leaving the first two frequencies, the remaining are grouped in threes and 


the highest total is marked. 


After completing the grouping table, an analysis table is formed for finding the value or 
the observation which is repeated the highest number of times. The same procedure for 
determining the modal class in the case of a continuous frequency distribution. Let us 


consider an example for clarifying the procedure. 


Example 56: The following table gives the measurements of collar sizes of 230 students in a 


university. Determine the modal size of the collar: 


Solution: Here we observe nearly equal concentration of frequencies for the collar-sizes 
36, 37 and 40. Thus, grouping method is used in locating the modal class. Following the 
steps in grouping method listed above we get the following table on the next page. 
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Determining Mode by Grouping Method 


Frequency 
Collar Size 
2P. 7 


Sue 


After forming the grouping table, values having highest frequency (shown in bold letters) 
are counted with the help of an analysis table as shown below: 


Analysis Table 


From analysis table we observe that size 36 is repeated the highest number of times. 


Therefore, Modal size=36 cms. 
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Example 57: Determine the modal value in the following series: 


Solution: In this case, equal concentration of values is observed at values 16 and 28. So 
we determine modal value by grouping method. 


Determining Mode by Grouping Method 


Frequency 


Values Contributing to the Highest Frequency 


Column No. 


Ww 
S 
ee) 
N 


Among the highest frequencies in different columns, the value 18 repeats most 
frequently. Thus, modal value =18. 


~~m 
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Determination of Mode in a Continuous Series 


In a continuous frequency distribution, frequencies are given in various classes. Now a 
class having maximum frequency is called the modal class. In case nearly equal 
concentration of frequencies is observed in two or more classes, the grouping method can 
be used to determine the modal class. After determining the modal class, the precise 
value of the mode is obtained by using the following interpolation formula. 


Modes i A D oeh ...(15) 
2fi-P-Rh 
where, h = the lower limit of the modal class. 


fi =the frequency of the modal class. 
fo = the frequency of the modal class preceding modal class. 
fa = the frequency of the class succeeding modal class. 


h = size of the modal class. 


Example 58: Obtain mode of the following distribution: 


10—20 20—30 30—40 40—50 50—60 60—70 


rer fs e e e a 


[B.C.A. (Meerut)2007] 


Solution: In this example, the highest frequency 45 lies in the class (40 —50). Thus, 
(40 - 50) is the modal class. For determining the mode in modal class, we use 


interpolation formula (15). 


Medex} 41-0 _yy, 
2fi- fo-f 


Here, fo =25, fi =45, fo =11,4 =40 andh=10. 
45 -25 
Mode = 40 + ——————- x 10 = 40 + 3.70 =43.70. 
90 -25 -11 


Example 59: Locate mode from the following data: 


mau |o |o [ol ale [ela 


Solution: In this case, see the nature of the given data. It is not a discrete frequency 


distribution. Instead, the mid-values of the classes with corresponding frequencies are 
given. Thus, class-intervals are first determined by using the given mid-values. In this 
example, maximum concentration of frequencies is at two places, as such, the modal class 
is determined by grouping method. 
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Determination of Modal Class by Grouping Method 


Frequency 


Classes 
= i 


ho |p 
he bs 

ko i r 
| ls so 


Analysis Table 


Classes Contributing to Highest Frequency 
Column No. 
25-39 |33- 45 145 59 |5365 || 65 73 | 73-85 1185795 


The analysis table shows that (55 — 65) is the modal class. So by using the interpolation 
formula (15), ie. 


Medea A h 
2fi-fo-~fh 


Here, fo =17 
fi = 
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fo =31 
h =55 
h=10 
29 -17 
Mode =55 E iss +12 =67.0 
58-17-31 


But 67.0 does not lie in the modal class (55 — 65) and therefore our answer is not correct. 
To overcome this difficulty we use interpolation formula (16), i.e. 


Madés§ = — Aol ga 
lA-fol*lA- fl 
=55 ge PN ote tes g 110 
|29 -17| +|29 -31| 12 +2 
=55 +12 
=67 


Example 60: Find the mode for the following distribution: 


10—20 | 20—30 | 30—40 | 40—50 | 50—60 | 60-70 


oor fs | o [e e e e [2 


Solution: In this example, also, the maximum concentration of frequencies is at three 
places, as such, the modal class will be determined by grouping method. 


Determination of Modal Class by Grouping Method 


Frequency 


Classes 


bss 
52. 


hs 

| 
k2 
TEE 


b2 
32 


j 
| 


ho 
k 


5 
10 
20 
22 
24 

6 

2 


1 ` 
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Analysis Table 


Classes Contributing to the Highest Frequency 


Column No. 
10—20 | 20—30 | 30—40 | 40—50 | 50—60 | 60—70 


v 


The analysis Table shows that (30 — 40) is the modal class. So by using the interpolation 


formula (15),we have 


Mode = } E E” 


2fi- P-R 
Here, 4 =30, fo =20, fi =22, fo =24 andh=10 
Mode=30a 2A ad 
44 -20 -24 


=30 + (=| x10 = infinite 


So, in this case also we do not get an admissible value of mode. As such, we apply 
interpolation formula (16) to get the Modal value, i.e. 


Modesta Ol ay 
lA-fltlA- fl 
=30 + [22—20] sigas 2 


Dee =30+5 
[22 -20|+|22 -24| 2 


= 35.0 


Determining Mode when Class Intervals are Unequal 


The formula (15) for locating the mode is applicable where the distribution has equal 
class-intervals. When the class-intervals are not equal, we must make them equal before 
calculating the mode. This is done on the assumption that the frequencies are uniformly 
distributed throughout the class. 
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Example 61: Calculate mode from the following table: 


Age (in years) 0—6 | 6-12 12—18 | 18—22 |22—24| 24—26 | 26—30 | 30—36 | 36—42 


paaa [o|o |s| olee |o ee 


Solution: In this example, the respective sizes of the class-intervals are 6, 6, 6, 4, 2, 2, 4, 
6, 6. However, by properly combining the middle classes of sizes 4, 2, 2 and 4, the series 
can be amended as under: 


The Given Series The Amended Series 


0-6 6 0-6 6 


Hence (18-24) is the modal class in this example. Now the mode for the amended series 


can be obtained as under 


Modes} +—A=f wg 
2h - fo- fr 


Put, h =18, fo =25, f =25, fo =18andh=6 
we find 
Medes aia 
70 -25 -18 27 


=18+2.22 =20.22 years 


Example 62: In a moderately asymmetrical distribution, the mode and mean are 64.2 and 67.4 
respectively. Find the median. 
Solution: The empirical relation between mean, median and mode is : 
Mode =3 Median -2 Mean 
Here, mode = 64.2 and mean = 67.4 
64.2 =3 Median —2 (67.4) 
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or 3 Median = 64.2 + 134.8 = 199.0 
or Median -> =66.33. 


Example 63: For a symmetrical distribution mean is 50.6. Find the values of median and mode. 


Solution: For a symmetrical distribution mean = median = mode 


Median = 50.6 and Mode=50.6. 


Example 64: For a moderately asymmetrical distribution, the median and mean are 30.2 and 
32.4 respectively. Determine mode. 
Solution: For a moderately asymmetrical distribution, 
Mode = 3 Median — 2 Mean 
Putting the given values in the above relationship, one gets 


Mode =3 (30.2) — 2 (32.4) = 90.6 — 64.8 = 25.8. 


Merits and Demerits of Mode 


Merits 
(i) It is comparatively easy to understand. 
(ii) It is the simplest descriptive measure of average. 


(iii) Itis not affected by extreme items. It can be obtained even if the extreme values are 
not given. 


(iv) It can be determined for open-end distributions. 
(v) It can be located graphically while mean cannot be ascertained through graphs. 


(vi) Mode has been defined as the most typical value of a distribution. Therefore, it is a 
useful average for many practical situations, such as, average size of shoe, average price 
of a commodity, the average type of dress, average wages and so on. 


Demerits 
(i) Itis not precisely defined. 
(ii) | It is not based on all the observations. 


(iii) It is not capable of being handled algebraically as its value is not based on all the 
observations. 


(iv) The mode does not exist in many cases while there may be more than one mode in 
other cases, i.e. it is not useful as an average in such situations. 


(v) The value of mode is significantly affected by the size of the class-interval which is 
the basis of grouping the frequencies. 
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Example 65: A distribution x, x9 ,...,X, with frequencies fi, fo ,..., fy is transformed into the 
distribution X,, Xə ,..., X„ with the same corresponding frequencies of the relation 


X; =ax; +b, i=1,2,3,...,n 


where aand bare constants. Show that the mean, mode and median of the new distribution are given 


in terms of those of the first distribution by the same transformation. 
[B.C.A. (Delhi) 2006; B.C.A. (Kanpur) 2004; B.C.A. (Meerut) 2002] 


Solution: Given X;=ax; +b, i=1,2,3,...,n 
(i) Let X and 7 be the means of the X; and x; values respectively. Then 
-> fiX; 2 filth) ad fix 
ee =i +p 
F7 Eg XS 
i i 
=ax +b 


Thus, the arithmetic mean is given by the same transformation. 


It will make to the lower limit of X;, change into alọ + band interval iwill change to 


` 


Thus, new median (M,') = (alọ + b) + 


=a My + b 
ay eki H Gai 
2 f _ f ~ fa 
Lower limit will become ax; + b and interval = a; 


a Mg’ =(alo +) + AE 


= Pit - 
-alio + At Lt | ssam +b 
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Example 66: In what respect A.M. is superior to mode and median as measure of central tendency. 


[B.C.A. (Meerut) 2003, 2002] 


Solution: 


(i) 


(ii) 


A.M. is capable of further mathematical treatment whereas mode and median have 
no such property. 


A combined mean of many series can be obtained, where as evaluation of 
combined mode and median through some formula is impossible. 


Arithmetic mean is least affected by the fluctuations of sampling, whereas mode 
and median are in victim of it. 


Median and mode can have end values of the series but A.M. will have the central value. 
Thus, median and mode may not be true representative of the data. 


A.M. depends upon all the observations, but medium and mode do not. 


A.M. is very well defined. But median and mode both are comparatively ill defined 
measures. 


Example 67: The monthly incomes of 10 families are given by the following table: 


Compute the arithmetic mean, geometric mean and harmonic mean of the above incomes. Which of 


above three means represents the data most suitable? Give reasons. 


Solution: Here arithmetic mean 


xXx; _ 1136 


n 


=7]13.6 


Geometric mean 


l 
G=(85 x70 x15 x75 x500 x20 x45 x250 x40 x36)10 


> log G = = [log 85 + log 70 + log15 + log 75 + log 500 
+ log 20 + log 45 + log 250 + log 40 + log 36] 
= 7 [1.9294 + 1.8451 + 1.17614+18751+ 2.6980 + 1.3010 
+ 1.6532 + 2.3979 + 1.6021 + 1.55631] 
=+ x 18.0342 =1.8034 = log 63.59 
> G=Ẹ7 63.59 
And harmonic mean. 
H- 10 


l l l l l l l l l l 


+ + + 
8&5 70 15 75 500 20 45 250 40 36 
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10 
~ [0.01176 + 0.0143 + 0.0667 + 0.0133 + 0.002 
+0.05 +0.0222 + 0.004 + 0.025 + 0.0278] 


10 


=———. = 42 18 
0.23706 


Clearly the geometric mean represents the data most suitable since out of 10 families, 8 
families have their income less than the arithmetic mean while the harmonic mean 
represents the lower income group only. 


Example 68: The expenditure of 100 families are given by: 


Expenses in R) 10-20 20-30 30-40 40-50 


The mode for the distribution is 24. Find the missing frequencies. 


Solution: Let A and F be the unknown frequencies of class 10-20 and 30-40 


respectively. 


Expenses in (7) No. of Families (f) Commutative Frequency 


Coa oo 
[id Neepscraee 


N =100=56+ F + B 
=> F, + Fy =44 val I) 


Here mode is 24 which clearly lies in the class 20-30 and so 20-30 is the modal class. 
Therefore from the formula, made Mọ is given by 


My <1+ A xh 
2f-fi-Ah 
where, 1=20, f =27, gi) =A, f =B andh=10 
s4=904 -A x10 
54-h - 
> 3A -2 B =27 (2) 


Solving (1) and (2), we have 
Ff =23, h =21 
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Example 69: A man travelled by car for 3 days. He covered 480 kms each day. On the first day 
he draw for 10 hours at 48 kms an hour, on the second day the draw for 12 hours at 40 kms an hour 
and on last day he draw for 15 hours at 32 kms. What was his average speed. 

[B.C.A. (Rohilkhand) 2010, 2008] 


Solution: Since the distance travelled is constant i.e., 480 kms each day, the 
appropriate average is the Harmonic mean. 


N 
H.M. = i i i 
“I X2 X3 
Here N =3, x4 =48, x3 =40, x3 =32 
= Ps gy 
l l l 
48 40 32 


Example 70: The following tables shows certain data collected for the three regions of a country: 


Average Annual Income 


per person (R) 


Obtain the above figures for the three regions taken together. 
[B.C.A. (Rohtak) 2009; B.C.A. (Rohilkhand) 2006] 


Solution: Total population =10 + 5 + 20 =35 million 


Average population = 35/3 =11.6 million 


Total number of literates =10 x 22 +5 ye +20 x a =5.2+3.4+8.2 
100 100 100 


=1.6 millions. 


x 100 =48 % 


. . À l 
Percentage of literacy in overall region = 


ny xy + UD) X9 + ng X3 
Overall average annual income person = ¥]93 =——— 
X + X2 + X3 
7 10 x 8000 + 5 x 5800 + 20 x 7500 _ 259000 
7 10+5+20 35 


= 7400 
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Example 71: Calculate the values of Arithmetic mean, mode and standard deviation for the 
following distribution: 


Wage (R) No. of Workers 


Solution: 


A.M.=X=A+ >se xi, Here A =35, 2 fd’ =97, N =100 


i=10 = length of class interval 


> X= S x10= 35 +9.7 =44.7 
100 


Mean = 44.7 
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By inspection, modal class interval is 40-50 


Mode (Z) 24 4 i _y; 
2fi - fo - fr 
Here, 4 =40, fi =28, fo =12, fp =22,i=10 


g 28-12 Y 
2 x28-12 -22 


> Mode (Z) =40 + 10 


=40 + = x10 =40 + 7.27 =47.27 


Mode (Z) =47.27 


Syd" 4) es 


Standard deviation (6) = N N 


Here = fd’? =415, E fd’ =97, i=10, N =100 


2 
gx (52) ad 
100 100 
= 4.15 -0.94 x1l0= 73.21 x10 


=1.792 x10 =17.92 


Standard deviation =17.92 


Example 72: In what respect A.M. is superior to made and median as a measure of central 


tendency ? 


[B.C.A. (M.D.U) Rohtak 2011; B.C.A. (Agra) 2010, 2007, 2005; 
B.C.A. (Meerut) 2008; B.C.A. (Rohilkhand) 2008] 


Solution: 


(i) A.M. is very well defined. But median and mode both are comparatively ill defined 


measures. 


(ii) A.M. depends upon all the observations, but median and mode do not. 


(iii) A.M. is capable of further mathematical treatment whereas mode and median have 


no such property. 


(iv) Acombined mean of many series can be obtained, whereas evaluation of combined 
mode and median through some formula is impossible. 


(v) Median and mode can have end values of the series but A.M. will have the central 
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value. Thus median and mode may not be true representative of the data. 


(vi) Arithmetic mean is least affected by fluctuations of sampling, whereas mode and 


median are in the victim of it. 
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Example 73: Give an idea of the uses and limitations of averages in summarising a given data. 
[B.C.A. (Kurukshetra) 2011; B.C.A. (Lucknow) 2009, 2005] 


Solution: 


(i) | Averages are used for obtaining variation, skewness, kurtosis and many other basic 


characteristics of a data. 


(ii) Averages help us in considering the mass of the data in one single value which 


represents the entire group. 
+ Limitations of Averages 


(i) | The averages do not tell anything about the nature and composition of the 
data under study. So the averages should not be expected to tell more than 


what they can. 


(ii) An average in itself is single simple value, so it can yield wrong 


interpretations too. 


(iii) At times, average in themselves fail to convey about the nature of the series, 


because two quite different series may have the same average. 


Example 74: Evaluate the values of mean, mode and median for the following grouped cumulative 


data: 


[B.C.A. (Lucknow) 2009, 2006; B.C.A. (Bundelkhand) 2008] 


an 
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Solution: These results can be tabulated as given below: 


No. of Days No. of Deviation from 


Absent 
(class) 


Students Cumulative | Assumed Mean 
Frequency Frequency A (22 5), 
(f) d=x-A 


Mean = de! -292.5 + 24488-3 L99 5 -8.977=13.5 
500 
ARPE 
Median = h + 2 xi 
f 
Here, h =10, c. f.=124, f =225,i=5, N =500 
250 -124 
Median =10 + eh =10 + Ê -10 +2.8=12.8 
225 225 
EE T 0g. a 
2fi-fo- fr 2 x225 -95 -93 
soea 19g 


450 -188 
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Example 75: The mean of the following frequency distribution is 62.8 and the sum of all 
frequencies is 50. Compute the missing frequency fi and fy. 


20-40 | 40-60 | 60-80 | 80-100 100-110 | Total | 


rE i o ea e 
[B.C.A. (Agra) 2009; B.C.A. (Meerut) 2006] 


Solution: We have 


5+fi+l0+ fp +7+8=50 > fy =20-f 


Now, we may prepare the table given below: 


Pow fs |» f o i 
Cow a o | o 
La e a 


w 
Cow | og | o | o 
Cona | o | o f w 


‘x, 3460-40 f; 
Mean Fie SOON eee mean = 62.8 
j 50 
3460 -40 fi 
— g 7628 =3460 -40 fi =3140 
>40 fi =320 or fi =8 
This, f= 
and fa =(20-8)=12 


Remark: We can use the formula for calculating the mode as: 


Mode, M, =x +h. pa 
(2 fi = fea = frst) 
Where, 
x, = lower limit of the modal class interval 
h = width of the class interval 
J, = frequency of the modal class 


fy- = frequency of class preceding the modal class 


Sr) = frequency of class succeeding the modal class. 


lr, 
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Example 76: Calculate the mode for the following frequency distribution: 


10 — 20} 20 — 30 | 30 - 40 | 40 — 50 | 50 — 60 | 60 — 70 | 70 - 80 


Ee |e | |e [ele [ee 


[B.C.A. (Meerut) 2010] 


Solution: As the class 40 — 50 has the maximum frequency, so it is the modal class. 


Xı = ‘N= š p = ” = = an a i 
, =40,h=10, f, =28, f] =12 and f, ,; =20 


Mode, M, =x, + h x sal 
(2 fe = fea = fest) 


=404 410x%—@8=!2) 
(2 x28-12 —20) 


=40+ {10 «net (40422) - 40 + 6.67 =46.67 
24 30 


Example 77: Calculate the mode for the following frequency distribution: 


25-30 | 30-35 | 35-40 40-45 | 45-50 | 50-55 


Solution: The class 35 —- 40 has maximum frequency. So, it is the modal class 
x, =35, fk =50, fk-1 =34, fe) =42 and h = 


(k= Set) | 


Mode, M, = xy + 5h x — 
(2 fk- Se-1 fk+1) 


5 eg e l =35.4{5 x30} 
(2 x50 —34 —42) 4 


- {35 + 2l- (35 +3.33) =38.33 
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What is meant by central tendency ? Describe the various methods of measuring it 
and point out the usefulness of each method. 
[B.C.A. (Bundelkhand) 2010, 2008, 2006; B.C.A. (Rohilkhand) 2009, 2004, 2000; 
B.C.A. (Avadh) 2008; B.C.A. (Garhwal) 2008, 2005] 
Give a critical description of important measures of central tendency and their 
merits and demerits. 
[B.C.A. (Meerut) 2011 2008, 2006; B.C.A. (Kanpur) 2009, 2005, 2003] 


What is statistical average ? What are the desirable properties for an averages to 
possess ? Which of the averages, processes these properties ? 


[B.C.A. (Garhwal) 2011, 2007; B.C.A. (Agra) 2009, 2005] 
What do you mean by central tendency ? What are the desirable for an average ? 
Which average possess most of these properties ? 

[B.C.A. (Kurukshetra) 2011; B.C.A. (Jodhpur) 2009, 2006; 
B.C.A. (Avadh) 2008, 2005; Kanpur 2007 ] 

Write a short note on Geometrical mean and Harmonic mean. 
[B.C.A. (Kanpur) 2008, 2005; B.C.A. (Agra) 2005] 


From the following values locate the mode and median 33, 20, 35, 37, 33, 35, 25, 


33, 94r; [B.C.A. (Meerut) 2005] 


Determine the mode in the following distribution: 


Kaa Graa oen aaao | 10 | 2o | 30 o fao fa 


Noo Fs Pepe pe tas [s 
Calculate mode by the method of grouping 


[easiness 
Ooo booo o S 2 OE 
e | 8 CCidz 


— ww 80 


[B.C.A. (Rohilkhand) 2006] 


ie 
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9. 


10. 


11. 


12. 


13. 


14. 


15. 


Find the median from the following distribution: 


[B.C.A. (Kanpur) 2005] 


Calculate the median and mode from the following series: 


No. of Students 


Marks Less than 
No. of Students 


From the following table calculate the mode and first and third quartiles: 


Income (®) 100-200 100-300 | 100-400 | 100-500 | 100-600 


[B.C.A. (Meerut) 2004] 


[B.C.A. (Rohilkhand) 2008; B.C.A. (Garhwal) 2004, 2006; B.C.A. (Meerut) 2005] 


Five percent of the workers in a firm, employing of 2000 men, earn than Ẹ 2.00 per 
hour, 480 earn from Ẹ 2.00 to 22.24 per hour, 35% from Ẹ 2.25 to Ẹ 2.49 per hour, 
370 earn from Ẹ 2.50 to Ẹ 2.75, 12% from Ẹ 2.75 to Ẹ 2.99 and the rest earn Ẹ 3.00 
or more per hour. What is the median wage ? 

[B.C.A. (Udaipur) 2011, 2007] 


In the following in complete distribution the value of median is 45, find out the 


missing frequencies: 


Class-interval | 10-20 |20-30 |30-40 | 40-50 | 50-60 | 60-70 | 70-80 


[B.C.A. (Rohilkhand) 2005, 2001] 


Calculate arithmetic mean of following marks in Economics obtained by 10 
students in a monthly test: 


ane lp p e 


[B.C.A. (Avadh) 2009, 2005, 2003; B.C.A. (Kanpur) 2006, 2004] 


16. 


17. 


18. 


19. 


20. 


21, 


a 
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Calculate the arithmetic mean and mode from the following table: 


vamen] o> lo] =| el o a o 


anarsa fo fsbo] =pwlols ps 


[B.C.A. (AMU) 2011, 2007; B.C.A. (Ajmer) 2009, 2005] 


(a) If the price of a commodity doubles in 4 years, what will be the average 
percentage ratio of increase per year ? 
[B.C.A. (Meerut) 2006] 
(b) The population of a country which was 30 crores in 1989 increased to 52 
crores in 2007. Find the percentage compound rate of annual increase. 
[B.C.A. (Udaipur) 2006] 


(i) A cyclist pedals from his house to his college at a speed of 10 km. p.h. and 
back from the college to his house at a speed of 15 km. p.h. What is his 
average speed ? 

[B.C.A. (Agra) 2009, 2006] 

(ii) Aman travels from Lucknow to Meerut at a speed of 30 miles per hour and 
returns by the same route at 60 m.p.h. Find the average speed for the whole 
journey. 

[B.C.A (Meerut) 2009, 2003, 2001; B.C.A. (Agra) 2005, 2003] 

(iii) A train goes at a speed of 20 km. per hour for the first 16 kms, at a speed to 
40 km. p.h. for 20 kms. It covers the last 10 kms at speed of 15 km.p.h. Find 
out its average speed. 

[B.C.A (Udaipur) 2011, 2005; B.C.A. (AMU) 2009, 2007; 


B.C.A. (Jabalpur) 2008, 2005] 


Explain the difference between positional (vocational) average and mathematical 


average. 
[B.C.A. (Kanpur) 2005, 2010] 


The heights of 10 students are given below: 


wo BBB BEEE 


nee cosy [e e o a e vrs fori 


Find the mean and median height. 


The annual death-rates of a country for the last 14 years are given below: 


Determine mode. 


ner 
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22. 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


Calculate arithmetic mean, median and mode of the following frequency 


distribution of the size of 5 households: 


a 
Number of Households 2 


a 


Find mean, median and mode from the following distribution: 


ne [oo a |e [nr [oso] a [oc 
reels [efefule[s[? 
Find mean, median and mode from the following distribution: 

ns [va f roosa osoa Ton 
morm) e f= | e e | we 


[Hint: Convert the given distribution into a simple distribution] 


Find median and quartiles from the following distribution: 


[Classes [11 15] 16—20 | 21—25 | 26—30 | 31—35 | 36—40 | 41—45 | 46—50 
revere] 7 | o | o || e 


[Hint: Obtain class boundaries before computation] 


Determine mean, median and mode from the following table: 


asn | o |o o o fao a o 


Na oswiens |25 [0 | [3 [os [rs | oo e 


[Hint: Convert the given cumulative distribution into a simple one] 


The mean height of 25 male workers in a factory is 61 inches and that of 35 female 
workers in the same factory is 58 inches. Find the average height of all the workers 
in the factory. 


The mean annual salary of all the employees in a factory was = 3000. The mean 
annual salaries paid to male and female employees are ¥ 3200 and Ẹ 2200 
respectively. Determine the percentage of male and female employed in the 
factory. 
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30. The sum of the deviations of a given observation from the A.M. is zero. 


31. Define mean, median and mode with their merits and demerits. Why mean is 
consider as the best average ? 


32. Find the mean, using direct method: 


10-20 | 20-30) 40-50 50- 60 
ao > fs f> fs] > 


33. 


0 — 100 | 100 — 200 | 200 — 300 | 300 — 400 400 - 500 
rw! [> | s |e fe 


34. Find the missing frequencies fj and fọ in the table given below, it is being given 


that the mean of the given frequency distribution is 50. 


35. Find the mean, using assumed-mean method: 


(i) 


10-20 | 20-30 | 30-40 | 40-50 |50- 60 


20-40 | 40-60 | 60-80 |80- 100 | 100- 120 


) 
10-20 | 20-30 | 30-40 | 40-50 |50- 60 


36. Calculate the mean for the following frequency distribution: 


Gii 


paas |o | e | e e e e 


a 
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37. 


38. 


39. 


40. 


41. 


42. 


43. 


Calculate the median for the following data: 


14 


Given below is the number of units of electricity consumed in a work in a certain 
locality. 


Consumption) «951 85-105 |105-125| 125-145 | 145-165 |165-185| 185-205 
(in Units) 
n BREE i 
Consumers 


The median value for the following frequency distribution is 35 and the sum of all 


the frequencies in 170. Using the formula for median, find the missing frequencies. 


Class Interval |0- 10| 10- 20 
res DORE 


Class 10-14 
Interval 
pomen e 


Given below is the frequency distribution of the heights of the students in a school. 


Height (in cm) 160 — 162 | 163-165 | 166-168] 169-171 | 172-174 


Find the average height of the maximum no. of students. 


Mode of the following series is 36. Find the missing frequency in it. 


Class Interval]0- 10| 10-20 | 20-30] 30-40 | 40-50 | 50-60 | 60-70 
o DAA 
Calculate the mode of the following data: 


Monthly Salary (in 7) No. of Employees 
0 5,000 


isoo |g 
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Answers 


i) M=8.5,Q; =7.5,Q3=9 (ii) M=207. 


ISAM =7 2.395 


— |= | 


— 


4. | 40 and 39 
. | 22 Marks 
ma (a) 19% (b) 3.11% 


— |= |= 
|N“ 


(i) 12 km. p.h. (ii) 40 miles per hours (ii) 23.4 km.p.h. 
. | Mean = 164 cm. Median = 164 cm 


. | Mean = Median = Mode =35 


Q; =60 inches, median = 61 inches,Q 3 =63 inches, P39 = 61 inches 


N 
oS 


De = 62 inches 


. |Mean = 60.55, Median = 59.68, Mode = 57.50 
. |Mean = % 347, Median =% 356.67, Mode = 354.54 


26. | Median =31.71, Q1 =25.93, Q3 =36.81 
Mean = 49.58, Median =58.33, Mode = 67.00 


80% Male, 20% Female 
fi =28 fy =24. 
(i) 28 w as C K 
35.33 
fi =35, fy =25 

goa 


nN 


NJN NINJIN 
vi | |= 


Chapter 


Measures of 
Dispersion 


5.1 Dispersion 
[B.C.A. (Meerut) 2010, 2009] 
Dispersion is the measure of the variation of the items. 


Or 


A measure of variation or dispersion describes the degree of scatter shown by 
observations and it usually measured as an average deviation about control value or by an 


order statistics. 


5.1.1 Objective of Measuring Variability or Dispersion 
[Kashi Vidhyapeeth 2011; B.C.A. (Meerut) 2010] 


Measures of variation or dispersion are computed to serve the following objectives: 


(i) For judging the reliability of averages: Measures of dispersion help us to study the 


extent or degree to which the data are scattered around its average on central value. 
(ii) Comparison of distributions. 
(iii) Useful for controlling variability. 


(iv) Useful in further statistical analysis. 
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5.1.2 Properties of a Good Measures of Dispersion 
[B.C.A. (Meerut) 2010] 


Since a measures of dispersion is the average of second order, the properties of a good 
measure of dispersion are the same as those for a good measure of central value as studied 
in the last chapter, i.e. these are: 


(i) It should be simple to calculate and easy to understand. 
(ii) It should be rigidly defined. 

(iii) Its computation is based on all the observations. 

(iv) It should be capable of further algebraic treatment. 

(v) It should not be affected by extreme items. 


(vi) It should be least affected by sampling fluctuations. 


5.1.3 Two Types of Measures of Dispersion 
(i) | Absolute Measures of Dispersion 
(ii) | Relative Measures of Dispersion 


(i) Absolute Measure of Dispersion: Absolute Measures of Dispersion are expressed 
in the same unit in which the observations are given. Thus, absolute measures of 
dispersion are useful for comparing variation in two or more distributions where 
units of measurement are same. Such measures are not suitable for comparing the 


variability of the distributions expressed in different units of measurement. 


(ii) Relative Measures of Dispersion: Relative Measures of Dispersion are expressed as 
ratio or percentage or the coefficient of the absolute measures of dispersion. In this way, 
relative measures are pure unit less number and are generally called coefficient of 
dispersion. Relative measures are useful for comparing variability in two or more 


distributions where units of measurement may be different. 


5.2 Important Measures of Dispersion 
[B.C.A. (Meerut) 2010] 


Following are some important measures of dispersion: 
l. Range 
Interquartile Range and Quartile Deviation 


Mean Deviation or Average Deviation 


He, oe oS 


Standard Deviation 


a 
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l. Range: Range is the simplest measures of dispersion. For a given set of observations, 
the range is the difference between the largest and the smallest observation. Thus 


Range; R=L-S (1) 
Here, L = the largest observation. 

S =the smallest observation. 

R = the Range. 


In case of grouped data, the range is defined as the difference between the upper limit of the 
highest class and the lower limit of the smallest class. 


Coefficient of Range 


Range is an absolute measure of dispersion which is unsuitable for comparing variation in 
two or more distributions expressed in different units. To over come this difficulty, a 
relative measure of dispersion called the coefficient of range is calculated by the 
following formula: 

L-S 


Coefficient of R: = —— (2 
oefficient of Range Ts (2) 


Solved Examples 


Example 1: Marks of 10 students in Mathematics and Statistics are given below: 


ir a 


(i) | Compare the range of marks in the two subjects. 


(ii) | Compare the coefficients of range for both the subjects. 


Solution: The lowest and the highest marks in mathematics are 10 and 45 respectively. 
Also the lowest and highest marks in statistics are 18 and 42 respectively. 


Thus, the range of marks in mathematics 
R=L-S=45 -10=35 
Range of marks in statistics 
R=L-S=42 -18=24 
L-S 45-10 35 


Coefficient of range (for Mathematics) = =0.64 
L+S 45410 55 


Coefficient of range (for Statistics) = bes oe 24- 40 
L+S 42+18 60 
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Thus, the range as well as the coefficient of range for marks in mathematics are higher 
than that of marks in statistics. 


Example 2: Find the range of the following distribution of families according to its size (No. of 
children). 


Size of the Family 


No. of Families 


Solution: In this example the variable (X) is the size of the family (No. of children). The 
minimum and the maximum size being O and 5 respectively. Thus, the range 
R=L-S=5-0=5 


Example 3: Find the range and coefficient of range from the following data: 


Mid Value 


Solution: Here, the given mid-values may be used to define class boundaries or true 


limits of various classes. Since the uniform class-interval is 5, thus the limits for the first 
class will be (5 —2.5,5 + 2.5), i.e. (2.5 —7.5), that of second will be (7.5 —12.5). The 
limits for the remaining class are shown in the table. 


Thus the range 


R = upper limit of the highest class-lower limit of the lowest class. 


=37.5 -2.5 =35.0 


Computation of Class Limits 


mss [oe 
ses [9 
srs | o 


L-S 


Also, coefficient of range = 
L+S 
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37.5 -2.5 
-37.5 42.5 
35.0 
~ 40.0 
= 0.875 


Example 4: Find the range and coefficient of range from the following distribution: 


Fen [7 | e | s | s ea | 


Solution: First convert the given inclusive class limits into continuous classes (exclusive 
classes) as shown in the table: 


Computing Continuous Class Limits 


[sis dd 


Here, L=40.5 and S=5.5 
R =40.5 -5.5 =35 
L-S 


Coefficient of range = 


L+S 


_40.5-5.5 
40.5 +5.5 


= 22 =0.76 
46 
Example 5: The smallest value in a set of observations is 20 with a range of 25. Find the largest 
observation and the coefficient of range. 


Solution: Given that, R =25, S =20. To find the largest observation L, we consider 
R=L-S or 25- L=20 
or L=25 +20 =45 


L-S 45-20 25_ 


= =— =0.37 
LS 454+20 65 


Coefficient of Range = 


eS 
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Merits of Range 


(i) 
(ii) 


It is simple to understand. 


It is easy to calculate and provides a broad picture of the scatteredness in data 


quickly. 


Demerits of Range 


(i) 
(ii) 


Its composition is not based on all the observations. 


It is affected by extreme items. A single extreme value, either the maximum or 
minimum can affect the value of range. 


It is very much influenced by sampling fluctuations. 


Range is a crude measure of dispersion. It does not tell us about the variation in the 
observations relative to the average. 


Range can not be calculated for open end distributions. 


Interquartile Range and Quartile Deviation: The range is based on two extreme 
items and it does not take into account the variation within the range for this 
reason interquartile range is defined. 


Interquartile Range includes the middle fifty percent of the Distribution : In other 
words, interquartile range is the difference between the third quartile (Q3) and the first 


quartile (Q) i.e. 
Interquartile range = Q -Q ald) 


In some case quartile deviation (Q.D.) or semi-interquartile range is defined as the 
average amount by which the two quartiles differ from the median therefore 


Quartile deviation or Semi-interquartile range 


_3-Q 
- = (4) 


The Coefficient of quartile 


Coefficient of Q.D. = ooo ail) 


Example 6: From the following information of wages of 15 workers, find interquartile range, 
quartile deviation and co-efficient of Q.D. 


Solution: Arrange the given wages in the ascending order as under 400, 420, 440, 440, 
450, 470, 480, 480, 490, 500, 520, 550, 580, 620, 680. 
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N+1 
Q is the size of Í =) lo 


=— =4th item 
4 
Q = 440 
Q; is the size of 3(N +) = 3(15 +) 
G 4 
Q =7550 
Interquartile - Range = @ - Q = 550- 440=7 110 


Q@ -Q _ 550- 440 
2 2 


-Q-Q _ 550-440 110 o 


=12th item 


Quartile deviation (Q.D.)= =%55 


Coefficient of Q.D. 


~~ Q+Q 550+ 440 990 ` 


Example 7: Scores are 


Find the quartile deviation. [B.C.A. (Meerut) 2004] 


Solution: Arranging given data in ascending order. This is a case of inclusive series. 
Converting it into exclusive series. 


ee ae 
ee 
Don S S 
Coe | ed 
Passos | sd 
pss) eo 
Tasca | 2d 
ssas | > 
Cas” | a 
a 


ana rea 
0 


9 
5 
7 
6 
2 
3 
2 
=50 
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Q. No. = (5) th item 


= =12.5 th term which lies in class-interval 9.5 -14.5 


Q;.No = (=) th item=37.5th lies in 29.5 -34.5 


ante) 


14.5-9.5 
Q =9.5 +P 125 -9) 


Q =9.54+3.5 =13 


= 2G 
Q =29.5 +e? 675 —37)=29.917 


Quartile deviation = =8.4585 


(Q3 -Q) _ (29.917 -13) 
To n oe 


Example 8: Calculate Q.D. and its co efficient from the following distribution: 


mno |s| oja eje] eefe 


meowows 1? ts bolsols[s[s > 


Solution: 


Computing Q; and Q3 


Taser [wae 


Computation of Q : 


Q is the size of + = aot) 


=12.5th item. 


Q =%61 
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Computation of Qs: 


N+l 


Q is the size of 3 í ) =37.5th item. 


Q3 =763 
op.-2 -Q _ 63-61 
2 2 
=71.0 


Coefficient of Q.D. = B=. = ala = Ez =0.016 
@ +Q 634+61 124 


Example 9: The following is the age distribution of 799 workers: 


Age-Group 20-25 | 25—30 | 30—35 | 35—40 | 40—45 | 45—50 | 50—55 | 55—60 
70 


Find (i) Q.D. (ii) Coefficient of Q.D. 


Solution: For determining Q.D. and its co efficient value, we first find the values of Q 
and Q from the given distribution. 


Computation of Q: 


Q is the value of > = ines =199.75th item. Thus, 30 — 35 is the J quartile class. 


4 


Computation Q and Q; 


Using the interpolation formula 
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199.75 -120 
+ —— x5 


=304 
100 
-30 4 398.75 
100 
=30 +3.99 =33.99 =34 years. 


Computation of Q; : 


7 
Qs is the value of — = : a = =599 .25 th item which lies in 45 — 50 class. Using the 
interpolation formula. 

3N 
4 
Q =) + xh 
J 
-454 599.25 -550 <5 =454 5x49.25 
120 120 


=45 +2.05 =47.05 =47 years. 


-Q _47-34 13 


_Q 
ae 2 2 2 


=6.5 years. 


- 47 -34 
and coefficient of Q.D. = BW Q = = 13 =0.16 
Q@ +Q 47+34 81 


Merits of Q.D. 

(i) Itis simple to calculate. 

(ii) It is easy to understand. 

(iii) It is useful for measuring variations in open-end distributions. 
( 


iv) Itis also very useful when extreme items are likely to affect the analysis. 


Demerits of Q.D. 
(i) The calculation of Q.D. is not based on all observations. 
(ii) It is not possible to give it further algebraic treatment. 


(iii) It is very much affected by sampling fluctuations. 


3. Mean Deviation or Average Deviation: Mean deviation of a series is the arithmetic 
mean of the absolute deviations of various items from some central value, such as mean, 
median or mode. Since some of the deviations about mean are positive and others 
negative, their algebraic sum about mean would be zero. That is why, is defining 
mean deviation, the average of the absolute [ignoring plus (+) and minus (—) signs] 
deviations from some central value is taken. In general, the central value can be any 
measure of central tendency, but usually the term mean deviation indicates 
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average of absolute deviations taken from mean. Symbolically, the mean deviation 
about mean, median or mode can be expressed as follows: 


(a) Mean deviation from mean X : 
l = 
MDa lX -x| ...(6) 


(b) Mean deviation from median Mj : 


1 
M.D.m, = 35 21 -Mal (7) 


(c) Mean deviation from mode Mg: 


l 
M.D.My = 57 21 X -Mol (8) 


Here, N = the number of observations. 
Z| X — X |= Sum of absolute deviations taken from mean. 
x| X - Mı |= Sum of absolute deviations taken from median. 
2| X - Mo | = Sum of absolute deviations taken from mode. 


In case of grouped data or frequency distributions, the above formulae can 


be put as: 
M.D.y=57IX-X| (9) 
l 
MD.m, =ġ 1X - Mal (10) 
l 
M.Dm, =77 1X -Mol (11) 


Coefficient of Mean Deviation 


Mean deviation is an absolute measure of dispersion. The corresponding relative 
measure called coefficient of mean deviation is obtained by dividing mean deviation by the 
average or central value used for calculating it. Thus, symbolically, 

M.D. 


Coefficient of M. D. = ——————— (L2 
Mean or Median or Mode 


In computing coefficient of M.D., it is important to note that the average used in calculating M.D. 
should be used in the denominator of formula (12). This relative measure of dispersion is useful 
for comparing two or more series which are expressed in different units or their averages 
are widely different. 
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Computation of Mean Deviation 


l. Individual Series 


Example 10: Compute mean deviation and its coefficient from the following data relating to the 
marks obtained by a batch of 11 students in a class test: 


pao [e e e e e e e e 


Solution: 


Computation of M.D. (Individual Series) 


Here, | -283 53 
N TI 
M.D -L sjx-x| 
N 
TOn 17:27 
ll 


Also, on using formula (12), the coefficient of mean deviation is 


_MD. _ 17.27 
“Mean 53 


— =0.326 
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Steps 

(i) Find Mean of the Series. 

(ii) Obtain absolute deviations of observations from mean, i.e., | X — X| 
(iii) Obtain the sum of absolute deviations, i.e. Z| X — X | 

(iv) Use formula (6) to get M.D. 


2; Discrete Series: 


Example 11: Calculate mean deviation from median for the following data. Also compute the 
coefficient of M.D.: 


Solution: First we determine median of the given discrete distribution. Median is the 


N+ 20+1 
value of (==>) item = — =10.5 th item. 


Median = M} =8 


Using formula (10), the M.D. from M, is 


l 
M.D.=— f| X- My| 
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Computation of M.D. from Median 


Using formula (12), 


The coefficient of M.D. = M = 28 =0.35 


3: Continuous Series 


Example 12: Compute the mean deviation (M.D. ) from the following data. 


20—40 | 40—60 60—80 | 80—100 | 100—120 
of a| efo e] 


Also find the coefficient of M.D. 


Solution: 


Computation of M.D. from Mean 


ooo] e 
O T p 


Measures of Dispersion 181 


X=A+th = =50 +20 4° ~50+1.07=51.07 
N 187 


M.D. ee ey enc y 
N 187 


M.D. 14.42 _ 


Coefficient of M.D. = —— 
Mean 51.07 


0.28 


Merits of M.D. 


It is simple to understand. 
It is easy to calculate. 
Its computation is based on all the observations. 


It is not affected by extreme observations. 


Demerits of M.D. 


In the calculation of mean deviation, the algebraic signs of the deviations are 
ignored therefore its definition is non-algebraic. In view of this, mean deviation is 
not used in further statistical calculations. 


If is not a well defined measure as any of the central value can be used in its 
computation. More so, the mean deviation calculated from different averages 
(mean, median or mode) will not be the same. 


Standard Deviation: The concept of standard deviation was first introduced by 
Karl Pearson. It is one of the most popular and important measure of dispersion. It 
satisfies most of the properties of a good measure of dispersion. The standard 
deviation is defined as the positive square root of the arithmetic mean of the squares of 
deviations of the observations from the arithmetic mean. It is also known as ‘root mean 
square deviation’ and is generally denoted by the small Greek letter o (called Sigma). 
Symbolically for ungrouped data or individual series, the standard deviation is 
defined as 
l 


S.D.=0= Bx XP | ..(13) 


In the case of a frequency distribution, the formula becomes 


o= R Xf x -xX | (14) 


The square of the standard deviation is known as variance. Therefore, variance for an 
individual series can be calculated by using the following formula: 


Variance = 0° =>- (15) 


In case of grouped data or frequency distribution, the formula for calculating 
variance will be 


Variance = 0° =~ xf (X axy ... (16) 
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Computation of Standard Deviation (S.D.) in Individual Series or Ungrouped. 


Data 


For ungrouped data, the following two methods are used for computing standard 


deviation: 


(i) 
(ii) 
(i) 


Direct Method 


Short-cut Method 


Direct Method: In this procedure, the following formula is used for calculating 
S.D. (0): 


=, fl _ x2 
o= [Zz xe] (17) 


o -da-e [E-e] -o 


Theorem: Show that for only discrete distribution their S.D. is not less than the mean 


deviation from the mean. 


[B.C.A. (Meerut) 2004, 2002, 2001] 


Proof: With usual notation we have 


=> 


SE f- M? >f EE flx- MI} M = mean 


wo (r= M? all zfix-mI} 
NE f E > (2fé; where &; =| x- M| 
pth tat ih Sf + t+ ha 
> (fill + faba +-+ faba)” 
AAS +83 +...22 fife bio 
ARÉ -62 20 


Being the sum of perfect squares. L.H.S. is never negative. Hence, the proposition. 


- 
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Example 13: Calculate S.D. from the following set of observations: 


X :10,11,17,25,7,13,21,10,12,14 


Solution: 


Computing S.D. 


Gi) 


Short-Cut Method: In most of the cases, the arithmetic mean of the given 
distribution happens to be a fractional value and then the process of taking 
deviation and squaring them becomes quite tedious and time consuming in the 
computation of S.D. To overcome this difficulty, short-cut method of 
computation is used which involves deviations from assumed mean. The short-cut 
formula for calculating S.D. is: 


2 2 
S.D.=0= = (z2) (19) 
N N 
Here, d = deviation from assumed mean, say A; i.e. d = (X — A) 


Xd =the sum of deviations 


Yd? = the sum of squares of deviations 


N =the number of observations. 
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Example 14: Find S.D. from the data in example 1 


Solution: 


Computation of S.D. 


Let, 


Using formula (19), i.e. 


Computation of S.D. in Grouped Data 
Any of the following three procedures may be applied to find S.D. for grouped data : 


1. Direct Method 
2. Short-Cut Method 
3. Step-Deviation Method 


xe 
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l. Direct Method: In direct method of computing S.D., the following formula is 


S.D.=0= E Xf x -X| ...(20) 


where symbols have their usual meaning. 


used: 


Example 15: Use direct method to calculate S.D. of the following discrete frequency distribution: 


p-o | 4 | 5s | 6 | 7 | 8 | 9 | v0 


Solution: Computation of S.D. (Direct-Method) 


Frequency f 


4 
5 
6 
7 
9 
0 


1 


Here, we first calculate: 


12 
15 
28 
20 
14 
5 
N= 


Using formula (20), 


237.64 
: I re |= V376 -1.54 


2: Short-Cut Method: In the direct method of computing S.D., we face a difficult 
situation when the arithmetic mean is not a whole number. To avoid such difficulties, 
we consider the deviations of observations from a suitably chosen assumed mean, say 
A. In short-cut method of computing the S.D. we make use of the following 


S.D.=6= -E (21) 


formula: 


N N 


where d is deviation from assumed mean. The short-cut method of calculating S.D. 
involves the following steps. 
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Example 16: Use short-cut method to find the S.D. of the data in example 3. 


Computation of S.D. (Short-Cut Method) 


SDs TST 


pee eae 


Solution: 


Using formula (21), 


ae. (4) 5- £ z 
5- a 


=y [2.38 - 0.0036] = 2.3764 =1.54 


3. Step-Deviation Method: In short-cut method of calculation our main aim is to 
simplify the deviations so that computations become easier. In grouped data, specially 
in continuous frequency distributions, we observe that the calculations can be further 
simplified if deviations (d) are divided by a common factor, say h, which is usually the 
size of the class-interval to get the step-deviations as d’. However, such a division is 
then reflected in the formula for computing S.D. which now becomes: 


sah KA (22) 


Steps involved in computing S.D. by using step-deviation method will be clear 
from the following example: 


Example 17: Use short-cut and step-deviation method for calculating S.D. of the following 
distribution. 


Age Groups (years) 25—30 | 30-35 | 35—40 | 40—45 | 45—50 | 50-55 
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Solution: Using Short-Cut Method. 


Computation of S.D. (Short-Cut Method) 


iS Mid d=(X 42.5) 
= ae 


| 25-30 | TE Fa 5 — eE — e) | 2250 | 


seas] w | as | o | o fof o | 
45-50 eel ee | ea 


[50-55 | F E 5 


The assumed mean is taken as A =42 5. Using formula (22). 
ae me 2 
zf (zaf | _ || 4625 (315 
N \N) | y] 100° | 100 


= [[4625 — 99225] = [363275] =6.03 years. 


Using Step-Deviation Method 
Computation of S.D. (Step-Deviation Method) 


ae ee fae 


= | 25-30 | ea 5 = | =28 


| 50-55 | e 5 


Here, A=42.5 andh=5. Using formula (22), i.e. 


gi — (4) 
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Putting the values from the table, 


185 (Oy E 
E H5- (G) | =5 [1.85 -0.3969)] 


=5 V (1.4531) =5 x 1.2054 =6.03 years. 


which is the same as that obtained by short-cut method. 


Example 18: From the following information, find the S.D. 


N =10, =X =60, £X? =1000 
[B.C.A. (Rohilkhand) 2006] 


= 7 (=) ee (2) 
o 


[100 -36] =8.00 


Solution: Using the formula (18), 


Example 19: Use the following information to find the variance. 
N =20, X =50, £X? =84000 [B.C.A. (Agra) 2004] 


Solution: Using formula (18), i.e. 


Squaring on both sides: 


2 
Variance = 0° = — — a 
N 


Putting the given values, one gets 


S 84000 
20 


- (50)? = 4200 -2500 =1700.00 


Example 20: Find the value of S.D. and the variance from the following information: 
E(X -7) =8, 5 (X -7 =535, N=15 


Solution: For computing the S.D. or the variance, we need the values of N, XX and rx? 
which can be gathered from the given information. 

Since E(X -7) =7 

or xX — 27 =7 
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or EX -15 x7=7 [Using Xc = Ne] 
EX =105 +7=112 il) 
Again E(X -7 =535 
or z (x? -14X + 49) =535 [Using £ cX =cXX] 
or EX? -145X + X49 =535 
or EX? -14 x112 +15 x49 =535 
EX? =535 +14 x112 -15 x49 =1368. re) 


Using (1) and (2) the variance 
ot =| EX. (2X) |_ LA 
“| N NJ} ip |15 15 
=91.20 -55.75 =35.45 


S.D. =o (35 45) =5.95 


Correcting Incorrect Values of Mean and Standard Deviation 


Important Remarks: 
(i) EX can be obtained if N and X are given. 
(ii) EX? can be computed if N, X and © are mentioned. 


(iii) It was observed in the last chapter that for correcting the incorrect mean value, we need to find 
the corrected XX and XfX. For this we subtract the wrong items from the incorrect XX or XfX 
and add to it the correct items. 


(iv) For correcting the incorrect value of S.D. or variance, we need the corrected values of XX and 
EX? both. Corrected ZX? (or XfX ) is obtained by subtracting the squares of the wrong items 


from the incorrect =x (or xx? ) and adding the squares of the correct items. 
The following example will clarify these points. 


Example 21: The mean of 100 items is 50 and their S.D. is 4. Find the sum and the sum of 
square of the items. 


Solution: 
(i) | Computing sum of the items, i.e. ZX: 
Given that N =100 and X =50 


The sum of the items = EX = N X =100 x50 =5000 
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(ii) Computing sum of squares of the items, i.e., xX? : 


Given that N =100, X =50, £X =5000 and o =4. 


2 2 
Also knowing that S.D. = © = a = (=) | 


Putting the given values, we have 


Squaring on both sides, 


2 
16= a — 2500 
100 
2 
or 2500 +16 =>% 
100 
or IX? =100 x [2516] =251600 


Example 22: The sum of squares of 20 observations was worked out as 5100. But while 
calculating it, an observation 31 was wrongly considered as 13. Find the corrected sum of squares. 


Solution: Given that incorrect sum of squares, i.e. Ex? =5100, Now, the corrected 
EX? = incorrect EX* — (Square of the wrong item) + (Square of the correct item) 
=5100 - (13)? + (31)? =5100 -169 + 961=5892 . 


Example 23: The sum of squares of 100 observations was calculated as 7961. Later, it was found 
that two values, 53 and 42 were wrongly read as 35 and 24 at the time of calculation. Find the 
corrected sum of squares. 


Solution: Given that incorrect EX2 =7961. 


Corrected XX? = incorrect EX? — (Squares of wrong observations) 


+ (Squares of correct observations) 
Corrected EX? =7961 - (352 + 24”) + (532 + 427) 


=7961 -(1225 + 576) + (2809 + 1764) =10733 


Example 24: The sum of squares of 50 observations is 4122. An observation 39 was wrongly 
included in the series. Find the sum of squares of the remaining 49 observations. 


Solution: Incorrect EX? = 4122. 
Corrected EX? = incorrect EX? — (Squares of the wrongly taken observation) 


= 4122 — (392) =4122 -1521=2601 
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Example 25: The arithmetic mean and the S.D. of a series of 20 items were calculated as 20 cm. 
and 5 cm. respectively. But while calculating them, an item 13 was measured as 30. Find the correct 


arithmetic mean and standard deviation. [B.C.A. (Meerut) 2008, 2006] 


Solution: Given that N =20, X =20, o=5, incorrect observation=30, correct 


observation= 13. 
Thus, in corrected EX = NX =20 x20 =400 
Corrected XX = incorrected £X — (incorrect observation) + (correct observation) 


=400 -20 +31=411 


C ted £X 
Corrected Mean = ee all =20.55 sell) 
N 20 


For finding the corrected S.D., we first find the incorrect IX? from the given X ando 


values by using the formula 


@ [BE (By | Bee 


or EX? =N (0 + X?) 
incorrect EX? =N (0% + X?) =20 (25 + 400) = 8500 


Thus, corrected XX? = incorrect XX? — (Squares of incorrect values) + (Squares of 


correct values) 


=8500 -307 + 137 = 8500 -900 + 169 =7769 ...(2) 
Using (1) and (2), the corrected S.D. becomes 


Corrected S.D. = joes! rx? 7 — =) 


N N 


2 
_ || 7769 _ a = [388.45 — 422.30 
20 20 
= (21.73) =4.66 


Example 26: The mean and S.D. of 20 items are found to be 10 and 2 respectively. At the time of 
checking, it was observed that one item 8 was incorrect. Find the mean and the S.D. if (i) the wrong 
item is omitted (ii) it is replaced by 12. 


Solution: Given that N =20, X =10,0 =2 and wrong item = 8. 


Thus, incorrect ZX = NX =20 x10 =200 
incorrect EX? =N (0° + X?) =20 (4 + 100) =2080 
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Case (i) when the wrong item is omitted 
Corrected N = incorrect N -1=20 -1=19 
Corrected £X = incorrect LX — wrong item =200 -8 =192 
Corrected £X? = incorrect ZX? - (Square of the wrong item) 
=2080 -8° =2080 -64 =2016 


Corrected Mean = Corrected 24 2 ee =10.11 
corrected N 19 


corrected £ X2 corrected EX z 
Corrected S.D. = am = ea | 


corrected N corrected N 


a (2) - {106.11 -10212] 


(3.99) =1.99 ~2.00 


Case (ii) when the wrong item is replaced by 12. 


Corrected £X = incorrect £X — (Wrong item) + (Correct item) 


=200 -8+12 =204 


Corrected X = one 2A 204 -10.2 
N 20 


Corrected £X? = incorrected ZX” — (Square of wrong item) 


+ (Square of correct item) 


=2080 -64 + 144 =2160 


Thus, corrected S.D. = 
N N 


corrected ZX? S — x) | 


ae (=) |- J108 -104 04] 
20 
[3.96] =1.99 


Algebraic Properties of Standard Deviation 


We observed that if the means and the number of items in different groups are given, the combined 
mean of all the groups can be computed. Similarly, combined standard deviation can be calculated if 
the standard deviations, means and number of items in different groups are given. The formula used 
for computing combined standard deviation is as under: 


l. 


Measures of Dispersion 195 


Combined S.D.: 


(i) For two Related Groups: 


ae ~x (o? + Df) + No (03 = (23) 
Ni + No 
where, 60,9 = Combined S.D. of two groups 
oq = Standard deviation of the first group 
© = Standard deviation of the second group 
N; = No. of observations in the first group. 
No» = No. of observations in the second group. 
Dı = (X; - X12), X12 = combined mean of the two groups. 
Dy = (Xz - X12). 
X = mean of the first group. 
Xə = mean of the second group. 
(ii) For three Related Groups: 
ee > (of + Dy) + No (03 + D5) + N3 (03 + i oa) 
(Ni + No + N3) 


Here, 6193 = Combined standard deviation of three groups. 
Dı =(X, - X123), D2 = (Xo - X123) 
D3 =(X3 — X123), X123 being the combined mean of the three 
groups. 
Other symbols have their usual meanings. The formula for combined S.D. can 


easily be extended to any number of groups. 


Standard Deviation and Normal Curve: For a normal distribution or a 
symmetrical distribution, the number of observations lying within specific ranges 
can be measured with a greater precision. In such distributions the percentage of 
observations lying in some specific ranges are as under: 


(i) X +0,also called 1-sigma limits, covers 68.27% of the total items. 

(ii) X +20, also called 2-sigma limits, covers 95.45% of the total items. 

(iii) X 30, also called 3-sigma limits, covers 99.73% of the total items. 
Empirical Relationship between Measures of Dispersion: For a normal 


distribution or for a moderately a symmetrical distribution, following empirical 
relationships exist between various measure of dispersion. 


(i) Quartile Deviation (Q.D.) = S.D. 


_—— 
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(ii) Mean Deviation (M.D.) = (5) S.D. 


(iii) Quartile Deviation (Q.D.) = (z] M.D. 


(iv) 6 S.D.=9 Q.D. =75 M.D. 
Example 27: A sample of 35 values has mean 80 and S.D. 4. A second sample of 65 values from 


the same population has mean 70 and S.D. 3. Find the mean and standard deviation of the 
combined sample of 100 values. 


Solution: Given that Nj 35, X 80, o, =4; 


No =65, X =70, 69 =3 


Combined Mean: 
Xo _ NX + No Xp _ 35 x80 +65 x70 _ 2800 + 4550 TT 
Ni + No 100 100 
Combined S.D. : 
D, =(X - X19) =(80 -73.5)=+6.5 +. DÊ =42.25 
Dy =(X» - Xj) =(70-73.5)=-3.5 +. D =12.25 
aali (of + DÊ) + No (03 + D5) 
i Ni + N3 
35 (16 + 42.25) + 65 (9 +12.25)]_ | [2038.75 + 1381.25 
35 +65 B 100 


= [34.20] =5.85 


Example 28: Find the mean and standard deviation of the two groups taken together: 


o | w» | w» |» 


Solution: Given that 


N, =113, X =160, 6) =22; Nə =120, X =150, o3 =20. 
Combined Mean : 
a NX, +NəXə _ 113 x160 +120 x150 _ 18080 + 18000 
mS NaN 113 +120 7 233 


oe 154.85 


T 
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Combined S.D.: 
D, = (X - Xo) = (160 -154.85)=5.15 ». D? =26.5225 


Dy = (Xp - X19) =(150 -154.85) =-4.85 +. DŻ =23.5225 


oo = [|D (0È + Pt) + No (03 + D3) 
12 Ni + No 


_ | [113 (484 + 26.5225) + 120 (400 + 23.5225) 
z 113+120 


_ [[57689.04 + 50822.70 
g 233 


|- [465.72] =21.58 


Example 29: The following data were recorded in two sample studies: 


100 


Calculate the combined mean and S.D. of the observations in the two sample together. 
[B.C.A. (Meerut) 2002] 


Solution: The following data are given for the two samples: 


Combined Mean : 
Fe NX, + NoX_ _ 150 x40+100 x55 _ 11500 


12 —______— =~ = 46.00 
Ni + Ny 150 +100 250 


For combined S.D., we first compute— 


D, =(X, - X19) =40 -46 =-6.0 ~. D? =36.0 


Dy =(X% - X12) =55 -46=9.0 +. D =81.0 


Putting all these values in the formula for combined S.D., i.e. 


sro = | | NLCot + Pt) + No (03 + D3) 
i N, + Np 


S — a 


]-9.14 
150 +100 250 


—— 
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Example 30: The mean of the two samples of sizes 50 and 100 respectively is 54.1 and 50.3 and 
standard deviations are 8 and 7. Obtain the mean and standard deviation of combined sample of 


size 150. [B.C.A. (Meerut) 2001] 


Solution: Given 
N; =50, No =100 
X, =54.1, X% =50.3 
©] =8, © =7 
Ni X +N X% _50x54.1+100 x50.3 


Combined Mean (Xj9) = = 
7 N + No 150 


=51.57 


Combined Standard Deviation 


Xi)" + No (Xp - X2) 
Ny + No 


S y50 x64 +100 x49 + 50 x6.4 +100 x 1.6129 


150 


3200 +4900 +320 +161.20 


E 150 


Ni of + N3 o2 + N; (X - 
O19 SO >>Rr«qOqWR—«B§>S 20 


61) = 7.564 


Example 31: The following data were recorded for three distributions: 


Obtain combined mean and S.D. of all the three distributions together. 


Solution: For the three distributions, the data can be put as: 


Combined Mean: 


Xo, -MX + NoXy + N3X; _ 20 x60 +50 x50 + 60 x40 
123 = NS N3 + N3 20 + 50 + 60 
_ 6100 


—— =46.92 
130 
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For combined S.D., we first calculate: 
D, =(X, - X23) =60 -46.92 =13.08 ~. DÊ =171.09 
Dy =(X - X23) =50 - 46.92 =3.08 ». Dz =9.49 
D3 =(X3 - X23) =40 -46.92 =-6.92 ». D2 =47.89 
Also given 6; =8, 69 =12 and og =20. Thus, the combined S.D. will be 


s S S 9 
AFE Ni (of + Dp) + Na (63 + D3) + N3 (03 + D3) 
a Ni + No +N3 


_ | [20 (64 + 171.09) + 50 (144 + 9.49) + 60 (400 + 47.89) 
g 20+50+60 


B —— +7674.50 + 26873 2 


— 
130 


130 


= [801.92] =17.38 


Merits of S.D. 
(i) Itis rigidly defined. 
(ii) Its computation is based on all the observations. 


(iii) It is amenable to further algebraic treatment which makes it the most important 
and widely used measure of dispersion. For example, the S.D. is used in computing 


skewness, correlation etc. It is an important statistical measure in sampling theory. 
(iv) Among all the measures of dispersion, it is least affected by sampling fluctuations. 


(v) S.D. enables us to determine the reliability of means of two or more series having 
equal means. In such a situation, a series having minimum S.D. will have the most 
representative mean. In other words, a smaller value of S.D. reflect greater 
compactness and smaller variability among items. 


Demerits of S.D. 
(i) S.D. is comparatively difficult to calculate. 
(ii) It gives greater weight to extreme observations. 


(iii) It is an absolute measure of dispersion and cannot be used for comparing 


variability of two or more distributions expressed in different units. 


~~m 
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Coefficient of Variation (C.V.) 


It is now clear that the standard deviation, as a measure of dispersion, gives us an idea 
about the extent to which observations are scattered around their mean. Thus, two or more 
distributions having the same mean can be compared directly for their variability with the help of 
corresponding standard deviations. Now the following two situations may arise: 


(a) When two or more distributions having unequal means are to be compared in respect of their 
variability. 


(b) | When two or more distributions having observations expressed in different units 
of measurements are to be compared in respect of their scatteredness or variability. 


For making comparisons in the above two situations, we use a relative measure of dispersion, 
called coefficient of variation (C.V.) The coefficient of variation (C.V.) is 


defined as: 
CVs x100 (25) 
X 


Example 32: A sample of 5 items was taken from the output of a factory. The length and weight of 
5 items are given below: 


ooro a a 


State which of the two characteristics of the two items is more variable. 
[B.C.A. (Garhwal) 2007] 


Solution: The two characteristics of the items, are length and weight expressed in 
inches and ounces respectively. Since the units of the two characteristics are different, 
their variability can be compared with the help of coefficients of variation which is 
computed as under: 


Computation of Coefficient of Variation 


Length (inches) Weight (ounces) 
dy 


Y 


On comparing the two C.V.'s we can say that the length characteristic is more variable 


than weight. 
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= | [68-064] = J [614] = /[7.8-l] =, 6.8) 


Co-efficient of variation Coefficient of variation 


248 2.61 
CVN O 100 CY -SLN 
= 780 = 17.00 ~ 


=31.79% =15.35% 


On comparing the two C.V. and we can say that the length characteristic is more variable 
than weight. 


Example 33: Below are the number of runs scored by two batsmen in eight matches. 


Foo BE 


Who is better run scorer ? Also find which of the two batsman is more consistent in scoring ? 


[B.C.A. (Agra) 2004] 


Solution: Since the average number of runs scored by the two batsmen are different, as 


such the coefficient of variations will be compared for observing consistency. 
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Computation of Coefficient of Variation 


(683.5 —4) = 4 (679.5) = [184575-4] 


=26.07 =, (184175) =42.92 


x 100 Beane 100 
2 


=52.14% = 82.54% 


Comparing the means of runs, we find that the batsman B is better in scoring. But since, 
the C.V. for X-series is less than that of Y-series, thus batsman A is more consistent. 
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Example 34: Compute coefficient of variation for factory A and B from the following: 


bocccs | 5 [10 | 20 | 35 | 2 | 8 _| 


Find: (a) Which factory pays higher average daily wages? 


(b) In which factory, wages are more variable. 


Solution: Computation of Coefficient of Variation 


5 
Ny =145| 224x | Bodx 
=465 =5905 


Sy D a A D ral 
90 145 


= Ẹ 20.67 = 7 20.21 


I (28 j 
Ny 


_ |[5905 (465% 
“Wy 145 145 
= J [85.11-13.44] = J [40.72 —10.30] 


= | (2067) =% 4.66 = J [30.42] =%5.52 


Gy = 100 = 5 a 
= 22.54% =27.31% 
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\ 202 P, 


Elements of Statistics 


(a) Comparing mean wages, we can say that factory A pays higher average daily wages. 


(b) Comparing the C.V.'s, we find that the wages are more variable in factory B. 


Example 35: In the following table, distribution of students is shown according to their weights in 
kg. Find the coefficient for each series. Which series show greater variation in weights ? 


[B.C.A. (Delhi) 2004] 


Computation of Coefficient of Variation 


Here A=45 and h=10 


Meane a a 
N 


=45 O45 les) 
62 


=46.25 kg. 


Nı N, 


=10 J[1.35 —0.0166] 


=10 ¥ [1.3334] =10 x11547 


=11.55 kg. 


Meanie a See 
N 


h (22 i 


Here also A=45 and h=10 


r 
2 


=45 e G +143 
56 


ey 


= 46.43 kg. 


=10x A [1.2143 —0.0204] 


=10 x J 1.1939 =10 x 1.093 


=10.93 kg. 


OTN 
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=X x100 = == x100 


46.43 
=24.57% = 23.54% 


On comparing the coefficients of variation for the two series, we find that class A shows 
greater variation in weights. 


Example 36: The means and standard deviations of two brands of light bulbs are given below. 
Calculate a measure of relative dispersion for the two brands and interpret the result. 


850 hours 750 hours 
S.D. 102 hours 60 hours 


[B.C.A. (Rohilkhand) 2006, 2003] 


Solution: 


Given that Xi = 850 hours Given that X =750 hours 


CVA le O OO CVSO TOO 
K 850 X 750 


Although a comparison of average life leads to the conclusion that Brand -1 is better, yet 


the comparison of C.V.'s shows that Brand —2 has less variability or greater uniformity. 
Thus, brand—2 is supposed to be better as it is more reliable. 


Example 37: In the factories A and B the average weekly wages and standard deviation are as 
follows: 


Average Weekly Wages R) S.D. of Wages No. of Workers 


Ce [| | » | » 


Answer the following: 


(i) | which factory pays larger amount as weekly wage ? 
(ii) | which factory shows greater variability in the distribution of wages ? 


(iii) what is the mean and S.D. of wages of all the workers in two factories taken 
together ? [B.C.A. (Meerut) 2003, 2002] 


ED Elements of Statistics 


Solution: The given data can be put as: 
X, =460, oo, =50, N,=100 
Xə =490, © =40, No =80 
(i) Total weekly wages paid by factory 
A==X =N; X =100 x 460 =% 46,000 
Total weekly wages paid by factory 
B =FEX =N; X, =80 x 490 =%39,200 


.. Factory A pays larger amount as weekly wages. 


(ii) For factory A, the C.V. = 2L x100 = a x 100 =10.87% 
J 


For factory B, the C.V. = 2 x100 = — x100 =8.16% 
2 
The comparison of C.V.'s reveals that factory A is more variable in respect of wage 


distribution. 


(iii) Combined Mean 


7 N,X, +N3X) 460 x100 + 490 x 80 
I =——_ ="... e- 


=% 473.33 
Ni +No 100 + 80 


For combined S.D., we first calculate— 
D, =(X, - X2) =460 - 473.33 =-13.33 +. Dp =177.69 
Dy = (Xp - Xj) =490 - 473.33 =+16.67 «. Ds =277.89 
of =(50)* =2500, 05 =(40)* =1600 

Putting these values in the formula for combined S.D., i.e. 


oo = [|D (0f + Pt) + No (03 + D3) 
12 N, + No 


_ | [100 (2500 + 177.69) + 80 (1600 + 277.89) 
7 100 +80 


267769 +150231.20 
= — [2322.22] =48.19 


180 
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Example 38: Two workers on the same job shows the following results over a long period of time : 


(Ln eae 


Mean time of completing the job (minutes) ee ae 
bers o 


(i) which worker appears to be more consistent in the time he requires to complete the job ? 


(ii) which worker appears to be faster in completing the job ? 
[B.C.A. (Delhi) 2007, 2002] 


Solution: 


(i) Given that X =25, X sta 36, 05 =16. 


For worker A, C.V. = = x100 = L x 100 =24% 
A 25 

For worker B, C.V. = x 100 = 23 x100 = 20% 
a 20 


The comparison of C.V.'s lead to the conclusion that worker B appears more 
consistent in the time needed for completing the job. 


(ii) | Since the mean time taken to complete the job is less for worker B, thus worker B 
appears to be faster in completing the job. 


Example 39: The following distribution shows the life of bulbs manufactured by two companies X 
and Y : 


Use of Bulbs No. of Bulbs 


Cn hours) Company (X) (Company Y) 


Which company is bulb have 


(i) Higher average life 
(ii) Greater consistency ? 


[B.C.A. (AMU) 2008, 2004; B.C.A. (MDU) Rohtak 2008; B.C.A. (Avadh) 2007, 2004] 
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Solution: 


Calculation of X and C.V (the coefficient of variance) 


Use of Bulbs| M.V. | A=350 Company X Company Y 


100-200 


ER 
ONAE 
a a EE 
TAC EAEE E 


Xfd 


Company X Company Y 


E Toe 
100 ae 


=350+0.25 | =350-0.25 025 


Since the arithmetic mean is higher for company X. Hence bulbs of company X 
hence higher average life. 


Company X Company Y 


(ii) 


= 41.25 -0.0625 y=yl. 35 -0.0625 


Ox =1.135 oy =1.135 
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C.Vy = ČX x100 = — 2 x100 =0.324% 
X 25 


=0.325% 


x — 
Y 349.75 


Since the coefficient of variation is less for company X, hence company X’s 
bulbs have greater consistency. 


Example 40: From the prices of shares of X and Y given below, state which share is more stable in 


value. 


[or [oe [oe [om on Poe [oe [on [oe [os 
[B.C.A. (Kurushetra) 2011, 2006] 


Solution: 


Go) 


Here, 


Coefficient of variation of X = 2x. x100 = 
X 44 


Standard deviation of Y-series, oy = 


Elements of Statistics 


N =10, Zdr =2, Idx? =106 


2 
Oy = m6 — =,/10.6-0.04 =3.25 
10 10 
25 
2 x100 =7.25 
8 
ay? _ (Ear 
N N 


2 
ey (a fe) = /8.5—.25 = 8.25 =2.87 
10 UO 


Coefficient of Variation of Y = + x100 


2.87 
945 


x100 = 304 


Since the C. V is Y data is less than C.V of X data, so shares of Y-series are more stable in 


value. 


Example 41: 


(i) Coefficient of variation of two series are 50% and 69%. Their standard deviations are 21.2 
and 15.6 respectively. What are their arithmetic means ? 


(ii) Mean of 100 items is 58 and their standard deviation is 4 fined the sum of squares of all 


items. 


Solution: 


G@. C.V=£x100 
X 
for series A: we have 


for series B: we have 


2 
(ii) o= =x" = (x)? or 
N 
= 
= 


or 


[B.C.A. (Delhi) 2011; B.C.A (AMU) 2011, 2007; 
B.C.A. (Kurukshetra) 2010; B.C.A. (Lucknow) 2008, 2004] 


ea eas ey =365.6 
Xa 
15; = 

69 = 2-7 100 = Xq =22.6 
XB 
x2 

o = -(X? 
N 
Ix? 2 

4)* =—__(50 

Gy =F59g 70) 


1600 + 250000 = £X? 


IX? =251600 


wz N 
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Example 42: Mean and standard deviation of the following continuous series are 31 and 15.94 
respectively. 


The distribution after step deviation is as follows: 


Determine the actual class intervals. [B.C.A. (Delhi) 2011; B.C.A. (Bundelkhand) 2008; 
B.C.A. (Kanpur) 2007; B.C.A. (Agra) 2004] 


Solution: 


ee ame 


2 
> 15.94 = 214 (5 xi 
101 UOI 
> 15.94 =42.713 - (0.376) xi 
> 15.94=,/2.713-—.141xi or 15.94 =1.60xi 
15.94 
> i= %24 =9.96 =10 (Approx) 
Also LAr y 
N 
31=A+(SS) x10 
101 
> 31=A-3.76 
> A=34.76 or A=35 (Approx) 


Hence, A=Assume mean = 35 and class interval (i) = 10 


2 10. b Elements of Statistics 


Example 43: A collar manufactures is considering the production of new style of callar to attract 
young man. The following statistics of neck circumference are available based upon measurement of a 


typical group of college students: 


ial vars [25] 0 [95] wo] ws] o a 


Nirgsuaoe [|v folwl>to fi fr 


Compute standard deviation and use criterian (X +30) to determine the largest and smallest size of 
collar he should make in order to meet the needs of practically all the customers bearing in mind that 
collar are worn, on an average 3/4 inch longer that the necks size. [B.C.A. (Meerut) 2004] 


a. 


Solution: Let A=145 & d= i=0.5 


Now X=A+ af y =14. 54 C28 a x0.5 =14.5 -0.268=14.2 inches 
zZ le (2) 544 (-124/ 
— x0.5 =.719 inches. 
231 — 


Using the criterion X+30 
Smallest size of collar = X -30+ > inch 
=14.232 -3 (0.719)+0.75 


=14.232 -2.157 +0.75 
=12.825 inches 
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Largest size of the collar = X+304+ - inch 


=14.232+3(0.719)+0.75 
=14 .232+2.157+0.75 
=17.139 inches. 


Example 44: Following is the data related to two factories: 


Average wage per hour 
Variance 


Find the following: 


(i) | Which factory pays larger amount as total wages per hour. 

(ii) | Which factory is more variable ? 

(iii) Find combined S.D. [B.C.A. (Garhwal) 2011] 
Solution: 


(i) | The total wages can be obtained by finding the value of ZX of both factories 


EX, =%3,000 EX, =%3,000 


Both factories are paying equal amount as total wage per hour 


(ii) | Variability can be obtained by finding the value of C. V. 


©] 
Cy = O0 CV, = 22 x100 
X Y 


= + a CV, =2-x100 =25% 
15 12 


Factory A is more variable at its C. V. than that of C. V. of factory B 
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(iii) Combine standard deviation = ©] 2 


Njo? + No04 + Ny d + No d2 
O12 am eua 
Ni + No 


Fiv _N,X1+NoX2 _ (200x15)+(250 x12) 
Ni +No 200 +250 


_ 3000 + 3000 _ 6000 
450 ~ 450. 


Where d, = X| - X12 =15 -13.3 =1.7 
dy = X2 - X12 =12-13.3=-1.3 


— =13.3 


2000 x16 + 250 x9 + (200 x(1.7)?) +250 x(-1.3)" 


oO = 

12 200 +250 
3200 +2250 +578+422.5 

l% 450 
64505 _ 4.33 =3.78 
450 


Example 45: Calculate Karl Pearson’s Coefficient of skewness from the following data: 


mame [> [el e e e e e e 


smn [ola e e e e e e 


[B.C.A. (Rohilkhand) 2007, 2005] 


Solution: The above data are in cumulative frequency form. Firstly, we convert these 
data into simple form: 


A=45 = 
Marks M.V.(m) fa 
ee eee 
[oa | 10 | 160 

oe ee a 
a0-30/ 20 | 25 | -20 | -2 | 40 | 8 | 70 | 


ws) o | s | o fo] o | o | «| 
aek pa ee E 


| 70-80 | JE 


oe] E 
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As this is a binomial series (i.e. there are two maximum frequencies), we will find 


coefficient of skewness by using the formula 
Coefficient of Sx = eM) 
o 


= E fd’ 
X=A+ T xi 


= (88) gas -2$ 239.27 


=45+ 


Median item = Size of Aih item = Size of th item=75th item 


Median lies in the class interval 40 —50 


ees 
M=Ļ4+2— xi 
f 
N j 
Here, L ZAO Sre J= 70; f E10; i=10 
75-70 


M =40 +—— x10 =45 
10 


2 
830 (-86 x10 
150 (150 


= 0.53 -0.33 x10 =J5 .20 x10 =2.28x10 =22.8 


X =39.27;M =45:0 =22.8 


3(X-M) _3(89.27-45) _3(-5.73) _-17.19 __) 2. 


Coefficient of Sx = 728 728 378 
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List of Formulae 


Range 


Individual Series Discrete and Continuous Series 


Range = L-5 Range=L-S 
-S L-S 


Coeff. of Range = cat) Coeff. of Range = —— 


+S L+S 

Here, Lis the upper limit of the highest 
class and S the lower limit of the lowest 
class. 


Quartile Deviation (Q.D.) 


Individual Series Discrete and Continuous Series 
meee oee Sa 
2. 2 
Coeff. of Q.D. = SO) Coeff. of Q.D. = Ome 
Q+Q Q+Q 


Mean Deviation (M.D.) 


Individual Series Discrete and Continuous Series 


(i) Actual Mean Method (i) Actual Mean Method: 
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(ii) Short-Cut Method: (ii) Short-Cut Method: 


EE -EE 


where d=(X-—A) where, d =(X — A) 


(iii) Step-Deviation Method: (iii) Step-Deviation Method: 


E Ae (xfa? 
m 


N N 


2 BN ce 2 9 
Cone le D eu wt Reet 


N, + No 


where, D, =(X, — Xj) and Dy =(X> - X12) 


Variance = 0° = (Sab? ie 


Percentage of Items Lying between Specific o Times 


(i) within X + lo or one o limits 68.27% 


(ii) within X + 26 or two o limits 95.45% 
(iii) within X + 36 or three o limits 99.73% 


(2 16 p Elements of Statistics 


Problem Set 


From the following data, find range and quartile deviation. Also determine their 


coefficients: 


m e CR Ce e 


uso [n elele e eee 
Calculate Q.D. and its coefficient from the following data: 

wm | o|ļa]ele]s |o] 
Sarwo | 1 |o |o | oe O ]r 


Also find range and its coefficient. 


Calculate range and Q.D. for the following data. Also find their coefficients. 


10-15 | 15-20 | 20-25 | 25-30 | 30-35 | 35-40 
coos ZEEE EES 


[B.C.A. (Agra) 2004] 


Calculate mean deviation from median of the prices given below : 


pass] 20 [20 95 [205 [205 [25 [ovo [2 [am [2 


Also find coefficient of mean deviation. 


From the table given below, find mean deviation from the median and also its 
coefficient. 


10-20 | 20-30 | 30-40 | 40-50 | 50-60 | 60-70 
reves [e[ufs[ul7[s_ 


Compare the dispersion in ages of boys and girls by using coefficient of mean 


deviation. 


jew lotta bs belo [s [2 [ 


[B.C.A. (Delhi) 2005] 


Using short-cut method and step-deviation method, obtain standard deviation of 
the following data: 


X :420, 440, 400, 420, 470, 480, 440, 480, 450, 500 


OON 
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a 


10. 


l1. 


12. 


13. 


Ten students in a class have obtained the following marks in statistics. Find the 
S.D. of marks. 


[Hint. Use short-cut method] 


Compute arithmetic mean and standard deviation from the following data : 


Also find the coefficient of variation. 


Find S.D. and C.V. from the following data: 


[B.C.A. (Meerut) 2007] 


Define S.D. Explain its features. 
[B.C.A. (Meerut) 2004] 


Define the term dispersion. Also differentiate between absolute and relative 


measures of dispersion. 
P [B.C.A. (Meerut) 2007] 
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Answers 


Ea Range=12 ; Q.D.=3.5 ; C.R.=0.07 ; C.Q.D.=0.04 

Es Range=20 ; Q.D.=1.5 kg ; C.R.=0.43 ; C.Q.D.=0.024 

ES Range=35 ; Q.D.=6.458 ; C.R. = 0.77, C.Q.D. = 0.33 
M.D.=12.74 ; C.M.D.=0.367 


Age dispersion in the boy is greater 


310), 333) 


X =4; o =1607 ; C.V.= 40.17% 


o =15.94;C.V.=51.42% 


¥=4059 ; 0° =22187 


ce S 


uun 


Chapter 


Permutations and 
Combinations 


6.1 Factorial Notation 


Let n be a positive integer. Then, the continued product of first n natural numbers is 
called factorial n and is denoted by n! or |n. 


Therefore, n!=n(n-])(n-2)....3.2.1 
or n!=1.2.3....(n-l)n 
Illustration: 6!=6x5x4x3 x2 xl=720 


5!=5x4x3 x2 x1=120 
4!=4x3 x2 xl=24 


Note: When nis negative or a fraction, n!is not defined. 


Theorem 1: 

To prove that Ol=1 

Proof: Since [n=n|n-1 

Therefore, put n=], we find 
U=1x[0 

or l0 = T =] 


or |0 =1 
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\ Solved Examples ¥ 


Examplel: Compute. 


> jd iii 
(i) ax (ii) [28 (iii) L.C.M. [l4, 15,16] 
(iv) B2 (v) J,4,1 
[47 |5 5 16 IZ 
Solution: 


l8 _8x7x6x5x4x3x2xIl_ 
l4x|3 (4x3x2xI)x@Bx2xl 


[30 30x29 x |28 
[28 [28 


(iii) We have 


=30 x29 =870 


|4=4x3x2xl=2? x3 
[5=5x4x3x2xl=29 x3x5 


16=6x5x4x3x2xl=24 x3? x5 
L.C.M. [l4, |5, [6 ]=24 x32 x5 =720 


(iv) 82 _ 52 51x50 x49 x48 x 147 5g stig x49 x2 =25,98,960 
|47 [5 |47x5x4x3x2xl 


l l loal l 1 42+7+1 50 50 


v) —+—+—=— + + = = = 
Ib l6 |Z L 6|[5 7x6x|5 7x6x|[5 7x6xl5 |Z 


Example 2: Convert into factorials. 


(i) 6.7.8.9 ii) 2.4.6.8.12 
(iii) 1.3.5.7.9.11 (iv) 18.20.22.26.28 
Solution: 
i 67 equ 68? 22 
[5 I5 
Gi) 2-4-6-8-10 =2° x(1-2:-3-4-5)=2 x|5 
(iit) 1-3-5-7.9-11 2 2343 6-7:8-9-1011 _ hel = u 
2-4-6-8-10 2? 0.2345) X? |5 


(iv) 18-20-22-24-26-28-=26 (9-10-11-12-13-14) 
2 [8-9-10-11-12-13-14 _ 46 [4 
[8 


loo 
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Example 3: Find x, if are ae ee ee 
4 B lbg 
Solution: We have, Lolilo 
l4 [5 le E 
l 1 l x 
=> — + + —— = — 
l4 5xļ4 6x5x|[4 7x6x5x|4 
30+6+1 x 
=> —— —— 
6x5x|4 7x6x5~x|4 
=> 7xX37=x 2>x=259. 


Example 4: Find n, if 


l l n 
i n+l=12xjn-l1 ii) — + — =— 
O | et ae in 
Solution: 
(i) [z+1=12xjn-1 
> (n+ I)nx|n-l=12x|n-1 
> (n+l) n=12 
= w +n-12=0 
> (n+ 4) (n—3) =0 
> n=3 (~ n cannot be negative) 
e l l n 
(ii) — + — =— 
L w WW 
1 l n 
> — + —— = — 
19 10x|9 11x10x]|9 
l n 
> 1+ — =—— 
10 11x10 
=> n=121 


Example 5: If |n+2=1,560 |n, find n. 


Solution: We have, "n+ 2 =1,560 |n 


(n +2) (n +1) [n=1,560 |n 
(n+ 2) (n+ 1) =1,560 
(n+ 2) (n+ I) =1,560 
£ +3n+2 -1560 =0 


E E E | 


1 +3n-1,558=0 
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—3+ V9 —4 (1) (-1,558) 


> n= 
2 
-3 + 6,241 -3 Ł 
=> n= z = o =38,-41 


Since n can not be negative so, n = 38 


Example 6: Prove that Be 1-3-5 .... (2n - 1) 2”. 
n 


Solution: We have, 
[2n _1-2:3-4-5 ....(2n-—1) 2n 


ln ln 
_(1:3-5....(2n-1)] [2-4-6....2n] 
7 Ln 
_(1-3-5....2n=1) (2 x1) - (2 x2)-(2 x3)....(2 xn) 
E ln 
_1:3-5....(2n=1) 2” (1-2-3-4-5 ....n) 
p ln 
_13-5....(2n-1) 2” |n 
7 ln 


=1-3 Dire (2n-1) 2”. 
Example 7: y and „l. are in the ratio 2 : l, find the value of n. 
[2 |n-2 [4 |n-4 


Solution: We have, 


Ln . Ln =?2:] 
Biga Ee 


i 4jn-4 
m ln B42 
[2 jn-2 [z 1 
4x3 |n-4 
(n—2)(n-3)|n-4 1 
> (n—2) (n-3) =6 
> we —5n+6=6 
=> 1 —5n=0 
> n(n—5)=0 


=> either n=O or n=5 
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But for 1 =0, |n—2 and |n — 4 are meaningless. 
Hence, the required value of nis 5. 


Example 8: Prove that |n(n+2)=|n+ |n+1 


Solution: R.H.S.=|n+jn+1 


=|n+(n+1)|n 
=|a(n+2)=L.H.S. 


\ Exercise 
1. Compute (i) l2 (ii) l2 -h0 (iii) „le, 
[29 i2 [13 13 
2. Find L.C.M. [[3, L6, [8]. 
3. pB u 4t a 


[3 }2n-3 |2|n-2 
4. Find x if 


5. Find n, if 


(i) Jjn+2=60x |n-1 (ii) Jn+2=2550x|n 
6. Prove that, 


(i) Bono- (n-2)....(r +1) 


(ii) Gop eae [u 


[tort [ner 


(ii) |n [a [ut l 


lkp 1” a r+l [ejn-r+l 


l and le. 
2 \(n-2) l4jn-4 


are in the ratio 2 : l, find n. 


R225 
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Answers 


EA (i) 29,760 1,310 (iii) 560 


Pfs Epee 


6.2 Permutation 


Permutation: The different arrangement which can be made out of a given number of 
things by taking some or all at a time, are called Permutations. 


For example, consider arranging the digits 1, 2 and 3. The possible arrangements as 
follows: 


123, 132, 231, 213, 312, 321. 


These are 6 permutations. Here the same three digits 1, 2 and 3 have been used but the 
number changes when the order of digits is changed. 


Thus, forming numbers with given digits means arranging the digits and hence it is the 
problem of permutation. 


Notations: Let r and n be positive integers such that 


l<r <n. 


Then, the number of all permutations of n things taking r at a time is denoted by 


Pinn or "P 


Theorem 2: Let 1 <r <n. Then the number of all permutations of n dissimilar things 


taken r at a time is given by 


"P. =n(n-1)(n-2)....(n-r+1)= Lu 


n-r 


Proof: We have 


L.H.S. = *P, =n(n—-1)(n-2).....(n—-r +1) 


n(n—l)(n—-2)...(a—r +1) |["-7 
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Theorem 3: The number of all permutations of different things taken all at a time is 
given by "P, = |n. 


Proof: We have, 


Np = Ln 
; |u—r 
Put r =n, we find 
ap _ lt _le_ln_), 
n-n |0 1 
Theorem 4: Prove that |0 =1. 
Proof: We have, 
AP = lz [put =r] 
n-r 
n 
=> "P, = lo ("Pa = Ln) 
=> [n= la 
LO 
=> |0 = lz = 
[z 
Example 9: Evaluate the following: 
@ SR (ii) OP, 
(iii) 8Ps (iv)  30pP, 
Solution: 
(i) 2p, = [12 — [2 _12 x11 x10 x9 x18 _ 1 989 
[12-4 [8 [8 
(ii) 6p, _ l6 _l6_6x>5 x4x3x2 -360 
6-4 |2 2 
or 6P, =6 x5 x4x3 =360 


(iii) 8p, =|8=8x7x6x5x4x3 x2 x1=40320 


30p, -_130_ _ 130 _30x29 x [28 570 


(iw 30-2 [28 [28 
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Example 10: If "Py =20 x "Po, find n. 


Solution: We have, "Py =20 x "Py 


> n(n—I) (n-2) (n-3) =20 n(n-1) 


=> (n—2) (n-3) =20 
> w -5n-14=0 

> (n—7) (n+ 2) =0 

> n=7 (wn #—2) 


Example 11: If 20 p, = 6,840, find r. 


Solution: We have, 20p, =6,840 


> LO -egi 
20 -r 

_ [20 _ 20x19 x18x [17 
[20-r [17 

. 2o _ [20 
[20=r° U7 

> 20-r=17 or r=3 

ll(n-1 
Example 12: "*°P,.) = ae x "t3 p find n. 


Solution: We have, 


CERP ii = 5 x "+3)p, 

- [+5 _U(n-)) [1+3 

|(n+5)-(n+1) 2 [443 -n 

(n+5)(n+4)|n+3 11(n-l)_ |”+3 

=> ae > _=EN MM 
4x|3 2 13 

> (n+ 5) (n+ 4) =22 (n-1) 

= iw +9n+20 =22n-22 
> i? -13n+42 =0 
> (n—6) (n-7) =0 


> n=6,7. 
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Example 13: If "P, =2 x>P3,find n. 


Solution: "P, =2 x `P 


> n(n-l)(n-2)(n-3)=2x5x4x3 
> n(n—1) (n-—2) (n—3) =120 

5 OÊ —3n) OÊ -3n + 2)=120 

> m (m + 2) =120 where m= -3n 

> m +2m-120=0 

> (m+12)(m-10)=0 

=> m=-12, or m=10 

=> Ê —3n=-12 or ê -3n=10 

=> Ê —3n+12=0 or f -3n-10=0 


3 + J9 -48 
= ee =O 


=> n 
3 +i 
=> a or n=5 or n=-2 
2 
> n=5 [neglecting negative and imaginary value] 


Example 14: Find nif °P; +5-?P, ='°P, 


n: 


Solution: We have, ? P 45. oR, = 10p, 


= l? „5. l2 H0 
p25 poa oaa 

=> B; B- uo 

l4 Is [l0-n 
, 2,12 bo 

l4 [4 lO-n 
=> 12 _ 10x12 

l4 100-7 


> 5 x|4=|10-n 
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> [10 -n=|5 
=> 10 -n=5 
> n=5. 


Example 15: Prove that 
"p, = n-lp, Jre -Dp 


Solution: R.H.S.=”7DP. +r. "DP. 


Sipil n-rt+r 
a (n-r)jn-r-1 
nnz 


le ala ="P.=L.H.S. 
(n=r)jn-r-l [n-r 


Example 16: Prove that 


(i) "P, = "Fini (ii) "Pp. =n- a ae 
Solution: 
n n 
(i) É n-l s= ==|n=” n 
n-(n-l) U 
- np la ae njn-l | (a-1) 
ü) "P s- -p -Dp 
n-r (n-l)-(r-]) 
56 p 54 p 


Example 17: If "*® "+3 =30,800:1, find r. 


Solution: We have 


and 54p , == s 
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56 /|50 -r 
56p _.54p B6 /p0-r _ 30,800 


r+6: r+3 ~“ p4/6l-r I 
> 56x55 x(51-r) =30,800 
> 5l-r=10 >r=4l 


Example 18: Prove that 


Sp. =8. 1P 
Solution: We have 
8 8 
L.H.S. = 8P; s E 
-> B 
8x7x6x5 x4 x| 
= = =6,720 
3 
7 
R.H.S Eo E 
7-4 
7x6x5 x4x[8 
= x—-———— 
B 
=6,720 
=> L.H.S.=R.HLS. 


Example 19: If III p,, =! 10D, +x! Upa then find the value of x. 
[B.C.A. (Agra) 2008] 
Solution: Here, we have 
1p), Op), 4 x, O (1) 
We know that 
mp ="Ip +r, "1P (2) 


Comparing (1) and (2), we get 
x=r=93 


Example 20: If "Poo =103. 102 Pie then find the value of n. 


Solution: Here, we have 
"P oo = 103.1? Pog (1) 
We know that 
"P. =n. "lP (2) 
comparing (1) and (2), we get 
n=103 
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Example 21: If *P; =5 x4x3x2xlLfind x. 
[B.C.A. (A.M.U.) 2010] 


Solution: Here, we have 


*P5 =5x4x3x2xl oil) 
We know that 

"P, =|n=nx(n-l)x...x3 x2 x1 (2) 
From (2), we have 

SPs =|5=5x4x3x2xl (3) 


Comparing (1) and (3), we get 
x=5 


Example 22: Each of 2 women and 3 men is to occupy one chair out of 8 chairs, each of which is 
numbered from 1 to 8. 


First, women are to occupy any two chairs from those numbered 1 to 4; and then the three men would 
occupy any three chairs out of the remaining 6 chairs. What is the maximum number of different 
ways in which this can be done ? 

[B.C.A. (Lucknow) 2010] 


Solution: First women can occupy one chair numbered from | to 4 in *p =4 ways. 
Second women can occupy one chair out of the remaining 3 chairs (numbered | to 4) in 
a P =3 ways. 

Ways of occupying chair by women's =4 x3 =12 


Ways of occupying the 6 chairs by 3 men's (out of the remaining 6 chairs) = 6P =120 
The maximum number of different ways of occupying chair = 12 x120 =1440 Ans. 


Example 23: A question paper had 10 questions. Each question could only be answered as True 
(T) or False (F). Each candidate answered all the questions. Yet, no two candidates wrotes the 


answers in an identical sequence. 


How many different sequences of answers are possible ? 
[B.C.A. (B.H.U.) 2010] 


Solution: Here the questions are non-repeatable whereas the answers (T or F) are 


repeatable. 


No. of different sequences of answers =2!0 ~1094 
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Example 24: A person X has four notes of rupee (%) 1, 2, 5 and 10 denomination. The number of 


different sums of money she can form them is ? 
[B.C.A. (Kanpur) 2010] 


Solution: The number of different sums 


Example 25: In how many ways can four children be made to stand in a line such that two of them, 
A and B are always together ? [B.C.A. (Kanpur) 2007] 


Solution: Let A, B, C and D are four children out of which AB is considered as one child 
thus we have three child which can be arranged in a line in 3 P; = 3 = 6 ways and the pair 


AB itself can be arranged in 2p, =|2 =2 ways. 


The required no. of ways =6 x2 =12 


| Exercise 


Į: Prove that ? Ps +3. P, = O., 
2. Prove that "P, =2 - "Py, _2). 
3. Find all permutations of 7 objects taken 3 at a time. 
4. (i) If"Py: "Ps =1:2,findn 
Gi) If "Dp, :”+DP, =5 :12, find n. 


(ii) If2"+1p _) :@"-Yp, =3:5, find n. 


n-1 

5. Ifllp. =!?p_,, findr. 

6. (i) If?"P3 =100 x "P, find n. 
(ii) If16 "Ps =13"*!P,, find n. 


(iii) If ”P; =20 x “P3, find n. 


(ii) 8 (iii) 4. 


(ii) 15 (iii) 8 
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6.2.1 Permutation with Repetition 
If out of n objects in a set, p objects are exactly alike of one kind, q objects exactly alike of 
second kind and r objects exactly alike of third kind and remaining objects are all 


different then the number of permutation of n objects taken all at a time is 


-la 
le L le 


Theorem 5: The number of permutations of n different objects taken r at a time when 


each object may be replaced any number of times in each arrangement is n. 


Proof : The number of permutations of n objects taken r at a time is the same as the 


number of ways of filling r places with n different objects. 


lst 2nd 3rd rth Place 


No. of ways 


Number of ways of filling first place = n, 

Number of ways of filling second place = n, 

(Since the object used in filling the first place can be repeated) 
Number of way of filling third place = n, 


Number of way of filling rth place =n 


Therefore, by the fundamental principle of counting, the required number of 
permutations 


=n-n-n.....r (times) =n" 
Example 26: How many ways are there to arrange the nine letters in the word ALLAHABAD. 


Solution: Since the word ALLAHABAD contains 4A’s and 2L’s, therefore, the total 
number of arrangement of nine letters in the word ALLAHABAD 


=—= =7560 ways. 
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Example 27: Find the number of different messages that can be represented by sequences of four 
dashes and three dots. 


Solution: The number of different message sent by 4 deshes and 3 dots 


oil gs 
413 


Example 28: Find the number of ways to point 12 offices so that 3 of them are green, 2 of them 
pink, 2 of them yellow and the remaining one white. 


Solution: The required number of ways 


-— -166,320 
l3 L L2 L 


Example 29: The letters of the word “TUESDAY” are arranged in a line each arrangement 
ending with letter S. How many different arrangements are possible ? How many of them start with 
letter D ? 


Solution: There are seven different letter in the word “TUESDAY” 


lst 2nd 3rd 4th 5th 6th 7th 


when last place is filled with letter S then remaining 6 places can be filled by 6 letters in 


Op. = 6! ways =720 ways 


Now, when the letter ‘D’ is also fixed, then remaining 5 places can be filled by 5 letters in 


> Pe =5! ways =120 ways. 


Example 30: How many signals can be made by using 6 flags of different colours when any number 
of them may be hoisted at a time ? 


Solution: Number of signals that can be made using | flag 
= °P, =6 


Number of signals that can be made using 2 flags 
! 
6 y= 6! 
(6-2)! 


=6x5=30 
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Number of signals that can be made using 3 flags 


! 
(6-3)! 


Number of signals that can be made using 4 flags 


6 6! 
=" P} =——— =6 x5 x4 x3 =360 
(6-4)! 


Number of signals that can be made using 5 flags 


! 
=op a"! 6 x5 x4x3 x2=720 
(6-5)! 


Number of signals that can be made using 6 flags 
=P; =6!=720 
Therefore the total signals that can be made 
=6 + 30 +120 + 360 + 720 + 720 =1,956 


Example 31: Find the number of 4 digit numbers that can be formed using the digits 1, 2, 3, 4 
and 5 if no digit is used more than once in a number. How many of these number will be even ? 


Solution: 


4 places can be filled by 5-digits in `P ways 


5! 


Jl 
= 6-4! ways = Ti ways = 120 ways 


2 ways 
For even numbers, unit’s place can be filled in 2 ways 
(either digit 2 or digit 4) 


If units place is filled by any one digit, then remaining 3 places can be filled by 
4! 


(4-3)! 


remaining 4 digit in tP = 


=4!=24 ways 


Hence, total number of ways =2 x 24 =48 ways. 
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Example 32: How many 4 letter words, with or without meaning, can be formed out of the letters 
of the word ‘LOGARITHMS:’ if repetition of letters is not allowed ? 


Solution: The word ‘LOGARITHMS’ contains different letters 
the number of required words 
=number of arrangements of 10 letters, taken 4 at a time 
— Pp, 
=10 x9 x8x7=5,040 


Example 33: It is required to seat 5 men and 4 women in a row so that the women occupy the even 


places. How many such arrangement are possible ? 
Solution: In a row of 9 positions, the 2nd, 4th, 6th and 8th are the even places. 
These 4 places can be occupied by 4 women in 
4P, =|4 =24 ways 
The remaining positions can be occupied by 5 men in 
` P, = [5 =120 ways 
. the total number of seating arrangements =24 x120 = 2,880. 


Example 34: How many words, with or without meaning, can be founded by using all letters of 
the word ‘DELHI’, using each letter exactly once ? 


Solution: The word ‘DELHI? contains 5 different letters. 
*. the number of required words 
=number of arrangements of 5 letters taken all at a time 
=°Ps =|5=5x4x3x2=120. 
Example 35: In an examination hall, there are four rows of chairs. Each row has 8 chairs, one 
behind the other. There are two classes appearing for the examination with 16 students in each class. 


It is desired that in each row all students belong to the same class and that no two adjacent rows are 
allotted to the same class. In how many way can these 32 students be seated ? 


Solution: There are 2 ways of selecting the rows for a class i.e., either I and III or II and 
IV. 


Now, 16 students of one class can be arranged in 16 chairs in 
16 P 6 = |16 ways 

and 16 students of another class can be arranged in 16 remaining chairs in 
16 P 6 = |L6 ways 


' the required number of ways =[2 x |16 x [16 ] 
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Example 36: How many odd numbers greater than 80,000 can be formed using the digits 2, 3, 
4, 5 and 8. If each digit is used only once in a number ? 


Solution: Ten 
Thousand Thousand Hundreds Ten's Unit's 


lway 2 way 
Since, it is an odd number greater than 80,000, therefore unit’s place can be filled by 2 
ways (either by digit 3 or by digit 5) and ten thousand’s can be filled by only 1 digit i. e.,8. 
Also other 3 places can be filled by remaining 3 digits in 
3 P =3!ways 
Total number of ways =2 x1x3!=12 (By F.P.C.). 


Example 37: How many four-digit numbers (between 1,000 and 10,000) are there with distinct 
2 of these, how many are odd numbers ? 


Solution: 


(i) Total number of arrangements of ten digits 0, 1, 2,3, 4,5, 6, 7, 8 and 9 taking 4 at 


a time is 10, But these arrangements, also include those numbers which has 0 at 


thousand’s place. Such numbers are not the four digit numbers. So, we have to 
exclude those three digit numbers when 0 is in thousand’s place, we have to 
arrange remaining 9 digits by taking 3 digits at a time. The number of such 
arrangements is 9 Py. 


Hence, the total number of four digit numbers 
=10P, -°P =5,040 -504 =4,536. 


(ii) Now, since we want the odd number, therefore unit’s place can be filled in 


5 (1 or 3 or 5 or 7 or 9) ways. 


9 
P3 ways S ways] ways Sp. ways 5 ways 


The required number of odd numbers = number of all 4 digited number (including 
0) having an odd digit in thousand’s place — number of these 4 digited odd 
numbers which have 0 in thousand’s place 


=5 x °P -1x P) x5 =5 x9 x8x7-8x7x5 =2,240. 
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Example 38: Find the number of permutations of n different things taken r at a time each of which 
includes 3 particular things ? 


Solution: Given r places, 3 places by particular things can be filled in "P3 ways. 


Now, we have to fill up the remaining (r — 3) places with the remaining (n — 3) things in 


n=3P 3 ways. 


Total number of permutations = ”P} x n-3p, _3. 


Example 39: Find the number of permutations of n different things taken r at a time from which 
three particular things are to be excluded. 


Solution: We exclude the 3 particular things from the total number of things and form 
permutations of the remaining (n — 3) things taken r at a time. 
Therefore, the required number of permutations 
= n zap, 

Example 40: How many permutation can be made out of the letters of the word ‘TRIANGLE’ 
How many of these will begin with T and end with E ? 

Solution: The word TRIANGLE contains 8 different letters. 
The number of all permutation made out of 8 letters 


= 8p, =|8=8x7x6x5x4x3 x2 x1=40,320. 


If the word starts with the letter T and ends with letter E then, the number of ways to 
form the words =8 P% 


-18 __l8 
8-6 l2 
=20,160 


Example 41: In how many ways can 10 books be arranged on a shelf so that a particular pair of 
books shell be (i) always together (ii) never together ? 


Solution: 


(i) Since a particular pair of books is always together, let us tie these two books 
together and consider the pair as one book. Then, we shall have to arrange 9 books 
on the shelf. 


This can be done ? Py = |9 ways 


Now, in each of these arrangements, the two books comprising the particular pair 


can be arranged among themselves in |2 i.e., 2 ways. 


n the required number of ways =2 x (|9) 
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(ii) we know that 10 books can be arranged on the shelf in 
Pio = [10 ways 


Also, the number of ways of arranging 10 books so that a particular pair is always 
together =2 x (|9) 


the number of ways of arranging 10 books so that a particular pair is never 


together =(|10 -2 |9) 
=(10x|9 -2 x |9) 
=(8 x |9) 


Example 42: There are 6 English, 4 Sanskrit and 5 Hindi books. In how many ways can they be 
arranged on a shelf so as to keep all the books of the same language together ? 


Solution: Let us make one packet for each of the books on the same language. 
Now, 3 packets can be arranged in 3 P = |3 =6 ways 

6 books on English can be arranged in |6 =720 ways 

4 books on Sanskrit can be arranged in |4 =24 ways 

5 books on Hindi can be arranged in |5 =120 ways 

"n the required number of ways = 6 x 720 x24 x120. 


Example 43: In how many ways 9 pictures can be hung of 6 picture nails on a wall (in a row) ? 


Solution: Here, it is obvious that only 6 pictures out of 9 will be hung and 3 will be left 
out. The number of ways in which pictures can be hung on 6 picture nails on a wall is the 
same as the number of arrangements of 9 things, taking 6 at a time. 


Hence, the required number = P, 


=9x8x7x6x5 x4 = 60,480. 


Example 44: A tea party is arranged for 16 people along the two sides of a long table with 8 chairs 
on each side. Four men wish to sit on one particular side and two on the other side. In how many ways 
can they be seated ? 


Solution: The order of seating arrangement is as below: 


MıMəM;M4 


YAY) A A O OD 0O 
PETEPTLETEETT ETE EEE TLE EEE TTD 
Sy, u,b BT By ky Bay 


II 


(i) 4 persons (M,, My, M3, M4) are to sit one-one side in any one side in any of 8 
chairs in 8p, ways. 
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(ii) 2 persons (M and M3 ) wist to sit at other side on any of the 8 chairs. This can be in 
8 P, ways. 
Gii) Rest 10 persons can sit on any of the remaining 10 chairs in 10 chairs in 10 Po 
ways. 
Total number of ways of sitting 
= 8P, x 8p, x sea (by using F.P.C.). 
Example 45: In how many ways can 6 balls of different colours, namely, white, black, blue, red, 


green and yellow be arranged in a row in such a ways that the white and black balls are never together 
2 


Solution: Let x be the number of all arrangements, each one of which contains white 
and black balls together. 


And, let y be the number of arrangements, none of which contains white and black balls 
together. 


Then, V+ y= EP, =|6 =720. 
Now, considering a white and a black ball tied together as one ball. 
The number of ways of arranging the 5 balls 
=P; =|5 =120. 
Also, these 2 balls may be arranged among themselves in [2 =2 ways. 


x =2 x120 =240 
So, y =720 — 240 = 480 


Hence, the requisite number of ways = 480. 


Example 46: In how many ways can the word ‘PENCIL’ be arranged so that N is always next to 
E? 


Solution: Keeping E and N together and considering then as one letter, we have to 
arrange 5 letters at 5 places. This can be done in 


> Ps = [5 =120 ways. 


Example 47: The principal wants to arrange 5 students on the platform such that the boy Salim 
occupies the second position and such that the girl Sita is always adjacent to the girl Rita. How many 
such arrangements are possible ? 


Solution: Let the seats be arranged as shown below: 
OS O00 0 
I I WIV V 

Keep Salim fixed at the second position. 


Since, Sita and Rita are to sit together, none of the two can occupy the first seat. 
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This seat can be occupied by any of the remaining two students in 2 ways. 
Now, two seats, namely, III , IV, V, may be occupied by Sita and Rita in 4 ways. 
The remaining seat may now be occupied by the Sth student in one way only. 
.. The number of required arrangement =2 x4 x1=8. 
Example 48: Find the number of ways in which 5 boys and 3 girls can be seated in a row so that 
two girls are together. 
Solution: The 5 boys may occupy 5 places in 
> ps =|5 =120 ways. 
Since no. two girls are to sit together, we may arrange the 3 girls at the 6 places. 
Le: XBXBXBXBXB 
Now, the 3 girls at 6 places may be seated in 
© Py =6 x5 x4 =120 ways. 
<. The required number of ways = 120 x 120 =1400. 
Example 49: How many permutations of the letters of the word ‘APPLE’ are there ? 
Solution: Here there are 5 objects, two of which are of same kind and the others are 
each of its own kind. 


The required number of permutations 


s— bb gð 
l2xUxUxU 


Example 50: (i) Find how many arrangements can be made with the letters of the word 
‘MATHEMATICS’. 


(ii) In how many of them are the vowels together ? 


Solution: 
(i) There are 11 letters in the word 
‘MATHEMATICS’ 
Out of these letters M occurs twice. A occurs twice, T occurs twice and the rest are 


all different. 


Required number of arrangement = =e = 4989600. 
|2 x|2 x |2 


(ii) The given word consists of 4 vowels namely A, E, A and I. Treating these four 
vowels as one letter, we have to arrange 8 letters 


(MTHMTCS) + (AEAI); out of which M occurs twice, T occurs twice and the rest 
are all different. 
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.. The number of such arrangements = ie _ =10080 


(12) x (L2) 


Corresponding to each such arrangement, the four vowels in which A occurs twice 
and the rest are all different, can be arranged amongest themselves in 


l =12 ways. 


l2 
‘. Total number of arrangements in which vowels are always together 


=10080 x12 =1,20,960. 


Example 51: In the different permutations of the word ‘EXAMINATION are listed as in a 
dictionary, how many items are there in the list before the first word starting with E ? 


Solution: Starting with E, we have to find the number of arrangements of the 
remaining 10 letters, EXAMINATION, out of which there are 2I’s, 2N’s, 2 A's and the 
rest of the letters are distinct. 


The number of such arrangements is 


sU __ 453600 
[2x (L2) x (l2) 


Clearly, we will have to start the next word with E, so the required numbers of items 
=4,53,600. 


Example 52: How many numbers greater than a million can be formed with digits 
2,3,0,3,4,2,3? 


Solution: Any number greater than one million will contain all the seven digits. 
Now, we have to arrange seven digits, out of which 2 occurs twice, 3 occurs thrice and the 


rest are distinct. 


The number of such arrangements = le. = 420 


[2) x [3 


These arrangements will also include those which contain 0 at the million’s place. 
Keeping 0 fixed at the million’s place, the remaining 6 digits out of which 2 occurs twice, 
3 occurs thrice and the rest are distinct can be arrange in 


l6 


= ——— = 60 ways. 


(l2) x [3 


The number of required numbers = 420 — 60 =360. 
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6.2.2 Circular Permutations 


So far we have been considering the arrangements of objects in a line. Such permutation 
are known as linear permutations. 


In circular permutations, what really matters is the position of an object relative to the 
others. 


If we consider the linear permutations 
ABCD, BCDA, CDAB and DABC 
Then, clearly they are distinct. 


Now, we arrange A, B, C, D along the circumference of a circle as shown: 


C 


If we consider the position of an object relative to others then we find that the above four 
arrangements are the same. 


Theorem 6: The number of circular permutations of n different objects, is |n — 1. 


Proof: Fixing the position of an object can be done in n ways, as the position of anyone 
of them may be fixed. 


Thus, each circular permutation corresponding to n linear permutations depending upon 
where from we start. 


ln 


Since, there are |n linear permutations, it follows that there are = 
n 


ie., | (n -— 1) circular permutations. 
Theorem 7: The number of ways in which n persons can be seated round a table is [”- 1. 


Proof: Let us fix the position of one person and then arrange the remaining (x — 1) 
persons in all possible ways. Clearly, this can be done in | (1 — I) ways. 
Hence, the required number of ways = jn- 1. 


Theorem 8: Show that the number of ways in which n different beads can be arranged to 


form a necklace is Z (n-1). 


Proof: Fixed the position of one bead, the remaining (n -— 1) beads can be arranged in 


[2- lways. 
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In case of arranging the beads, there is no distinction between the clockwise and 
anticlockwise arrangements. 


. Í 
So, the required number of ways = 5 [(z— 1). 
Example 53: A gentleman has 6 friends to invites. In how many ways can he send invitation 
cards to them, if he has 3 servants to carry the cards ? 


Solution: Two friends can be invited by the same servant. Hence, in this problem, the 


servant is repeatable (R) and the friend is non-repeatable (NR). 
Number of ways = [RINE =3° =729. 
Example 54: A telegraph has 5 arms and each arm is capable of 4 distinct positions, including the 


position of rest. Find the total signals that can be made ? 


Solution: Two arms may have the same position, but same arms cannot have two 


positions at a time. Hence, position is repeatable (R), but arm is non-repeatable (NR) 
Number of ways = [R]N® =4° =1024 

But, in one case, when all the 5 arms will be in rest position, no signal will be made. 

Hence, the required number of signals = 1024 —1=1,023. 

Example 55: Find the number of way in which 3 friends can stay in 2 hotels ? 


Solution: Two friends can stay in the same hotel but same friend cannot stay in two 
hotel at a time. Hence, hotel is repeatable (R) and friend is non-repeatable (NR). 


Number of ways = [RME = [2 =8. 


Example 56: Find the number of ways in which Dr. Bansal can give 2 hotels to his 3 sons ? 
Solution: A son can get 2 hotels but a hotel cannot be given again and again. 
Hence, son is repeatable (R) but hotel is non-repeatable (NR). 

Number of ways = [RIME = (3)° = 9) 
Example 57: Find the number of ways in which 6 pictures can be hung on 9 picture nails (in a 
row) ? 


Solution: A picture can not hung on two picture nails at a time, but two pictures can 


hung on one picture nail. 
Hence, picture nail is repeatable (R) but picture is non-repeatable (NR). 


Number of ways = [RINE = (9). 
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Example 58: In how many ways can 8 students be arranged in 

G) aline 

(ii) a circle. 

Solution: 

(i) The number of ways in which 8 students can be arranged in a line = |8 
=8x7x6x5x4x3 x2 x1=40,320. 

(ii) | The number of ways in which 8 students can be arranged in a circle = [8 -l=7 


=7x6x5x4x3 x2 x1=5,040. 


Example 59: In how many ways can a party of 4 men and 4 women be seated at a circular table so 
that no two women are adjacent ? 


Solution: The 4 men can be seated at the circular table such that there is a vacant seat 
between every pair of men. 


The number of ways in which these 4 men can be seated at the circular table = |3 =6. 
Now, the 4 vacant seats may be occupied by 4 women in 

4P, =|4 =24 ways. 
"n The required number of ways = 6 x 24 =144. 


Example 60: In how many ways can 7 persons sit around a table so that all shall not have the 
same neighbours in any arrangements. 


Solution: 7 persons set at a round table in |6 ways. 


But, in clockwise and anticlockwise arrangements, each person will have the same 
neighbours. 


So, the required number of ways = 5 x [6 =360. 


| Exercise \ 
l. In how many ways can 5 person occupy 3 vacant seats ? 
2. In how many ways can 5 children stand in a queue. 


3. In how many ways can 4 different books, one each in Chemistry, Physics, Biology 
and Mathematics be arranged in a shelf ? 


4. Seven students are contesting the election for the presidentship of the students 
union. In how many ways can their names be listed on the ballot papers ? 


5. Find the number of different 4 letter words that can be formed from the letters of 
the word ‘NUMBERS’. 


= 
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12. 


13. 


14. 


15. 


16. 
17. 


18. 


Ten students are participating in a race. In how many ways can the first three 
prizes be won ? 


In how many ways can 6 boys and 5 girls be arranged for a group photograph if the 
girls are to sit on chairs in a row and the boys are stand in a row behind them ? 


[Hint: 5 girls can sit on chairs in a row in > Ps = |5 =120 ways 

Also, 6 boys can stand behind them in a row in 6p. = |6 =720 ways. 
The total number of ways =120 x 720 = 86400. 

How many words can be formed from the lettes of the word ‘SUNDAY ? 
How many of these begin with D ? 

In how many ways can the letters of the word ‘FAILURE’ ? 


Find the number of permutations of the letters of the word ‘ENGLISH’. How 
many of these begin with E and end with I ? 


In how many arrangements of word ‘GOLDEN’ will the vowels never occur 
together ? 


In how many ways can 6 books on Economics, 5 on History and 3 on Hindi be 
arranged on a shelf so that the books on each subject are always together ? 


A child has plastic toys bearing the digits 2, 2 and 5. How many three digit 
numbers can it make, using them ? 


There are three blue balls, four red balls and five green balls. In how many ways can 
they be arranged in a row ? 


How many arrangement can be made out of the letters of the word. 

(a) INDIA (b) ENGINEERING (c) EXAMINATION 

How many numbers can be formed with the help of the digits 5, 4, 3,5, 3 ? 
In how many ways can 6 persons be arranged in 

(a) aline (b) a circle 


There are 5 men and 5 ladies to dine at a round table. In how many ways can they 
seat themselves so that no two ladies are together ? 


Answers ` 


EEN a CTE e fef 


mso pa [oroa 


2,520 (a) 60 (b) 2,77,200 (c) 49,89,600 


17.| (a) l6 (b) 120 [4 x [5 =2,880 
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6.3 Combinations 


Combinations: Each of the different groups or selections which can be formed by taking 
some or all of a number of objects, irrespective of their arrangements, is called a 
combination. The total number of combinations of n distinct objects taking r (1 <r < n) at 


n 
a time is denoted by C (n, r) or ”C, or B 
where, ”C, is defined only when n and r integers such that n >r and n >0,r 20. 


Example: The combinations of 4 objects a,b,c, d taking 2 at a time are 


ab, ac, ad, be, bd, cd 


6.4 Difference between a Permutation and a Combination 


In combination, only a group is made and the order in which the objects are arranged is 
immaterial. In permutation, not only a group is formed, but also an arrangement in 
definite order is considered. 


Note: We used the word ‘arrangement’ for permutation and selections for combinations. 


Theorem 9: The number of all combinations of n distinct objects, taken r at a time, is 
given by 


"ne = Ln 
= 
Ir n-r 


Proof: Let the number of all combinations of n objects, taken r at a time, be x. 
Then, ICE 


Now, each combination contains r objects, which may be arranged amongst themselves is 
Lr ways. 


Thus, each combination gives rise to |r permutations. 


n x combinations will give rise to x x |r permutations. 


So, the number of permutations of n things, taken r at a time is x x Lr 


Hence, "P= xl r=" Gx 
nO "P, = Ln "p = Lu 


= (n-2)....n fact 
Note: We can write "G.= ee 
j 
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Theorem 10: Let 0 <r <n. Prove that "C, ="C, 


n=Ff" 


Proof: We have KONEI — a Ln nG 


-r p-0-0 ea i 


Theorem 11: To prove that 


Kar + Eri = DEG 


Proof: We have, 


"C, af $C, = Ln |n 
[Ir jn-7 [ral je-G-) 
ln Ln 


“Terr [ral n-rt+l 


_la(-r+) |[n-r 
~ Le |= +1) lz |n-r+1 


A l ahere» 
rjn-r 


Cea) Ce  nng 
=o, 


~[rjn-r4l |rjn+l-r 
Theorem 12: If 1 <r <n, prove that 


ae ae) ame =(n—-r+l)x "C 


Proof: We have 


n-l 
nx "DC a EO TEE a 
(qr -1x|m-)-(r-1) 
nxjn-l 
ir-lin—r 
la (n-r +1) 


"pr aixprarx(n=r +) 


ln 


=(n—r +l) x ———>—_ 
[Palx|parel 


=(n-r +1) x ”C,_] 


~m 
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Theorem 13: If andr are positive integers such that 1 <r < n then 


AG. n-r+l 


n CG, =j r 


Proof: We know that 


"Os lz and "C, a U = 
i lz pn=r = (fuixie-r el 


wos |n l r-ljn-r+l1 


"C, Le|n-r E 


Vat x@ar+l nr 
E rx r-l) jn-r 


E n-r+l 
= : 


Theorem 14: If 1 <r <n, prove that 


ig Gi + ss Cee = as Cone 
ly, ln 
eijear [eel eres 


s— H- -y | 
lz x(n-r)jn-r-l1 (r+l)|rxjn-r-l 


O Lpd 
lz jpn-r-1\(n-r r+l 


Ln m+l) (n+l) |n 


“ryn r q r) +1) (r+ |lrx(n-r)xjn-r-l1 


= jut _ [w+ =%tlo 
“tte etry 


A n n _ 
Proof: C, + "C41 = 


Theorem 15: Prove that "Cp = "Cy > p=qorp+qe=n. 


Proof: We have, 
Na n an 
C, = C = Chg 


=> P=q or p=n-q or prqen. 
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Evaluate 


Example 61: (i) 1% Cy, (ü) Cso (iii) ? C3. 


Solution: 
100 x99 x98 
. 100 100 100 
Coz = C =-~" C3 = ————— = 161,700 
(i) 97 100 -97 3 yo] 
(ii) 60 C60 =] 
(iii) 2 0 
3x2xl 
Example 62: Find n, if 
(i) "Cy = "Co (ii) E =15 


(ii) 77 Cae =% CGn (i) 7O : "O =9:2. 


Solution: 


(i) "C7="Cg and7#9=n=7+9=16. 


Gi) 7G... 215 4 —— 245 
n—4 |n—-(n-A) 
= = lt is 
n-4 |4 
> ln =15 x |4 jn- 4 
> n(n-l)(n-2)(n-3)=360 
> n=6. 


wee 24 24 
(iii) C45 =^ C3n-1 


> n+5=3n-lor24=n+5 + (3n-!l) 
=> n=3orn=5. 
(iv) We have, 
AC : "C3 =9:2 
2n n 9 
l2 [2n-2 |[Bin-3 2 


p acd pedea 


2 3 
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6(2n-l) 9 


(n—-)(n—2) 2 
=> 8n-4=3 —9n+ 6 or 31” -17n+10=0 
=> (1-5) Bn-2) =O orn=5,n= 2. 


But only n =5 satisfies the given equation. 


Hence, n=5. 


Example 63: If"P, =720 and "C, =120, find r. 


n 
Solution: We know that “C,. = T 
5 
72 720 
120 =— => y= — = |3 
Ir e=T59 78 
Hence, r=3. 


2” «1-3-5 ....(2n-1)] 


Example 64: Prove that 2"C, = T 
n 


|2" _ l2» 
[nj2n—n— (a 
(2n) (2n—1) (20-2) ......4-3-2-1 


Solution: ?”C, = 


(Li)? 
_ [2n (2n-2) (2n-4)....4-2] x [(2n- 1) (Qn-3)....5-3-]] 
7 (Ln)? 
_ 2" [n(n-]) (n-2)....2-1] [27 -1) (2n-3)....5-3-]] 
7 (Li? 
_ 2" [n[1-3-5....(2n-3) (2n-D] 
7 (Li 
_ 2" x [1-3-5 ....(2n—3) (2n-1)] 
a 


Example 65: If "C1 :"C,: "Cy 41 3:4:5 find nandr. 


a n 
Solution: eee and 7 C, mc 
CG, 4 r+l 5 
7 3 r+l 4 
=> = — E 
n-r+l 4 n-r 5 
> 3n-7r =-3 and 4n-9r =5 


27,n=62 


U 
i 
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Example 66: If Cg7 =" Cy, then find the value of x. [B.C.A. (Meerut) 2007] 


Solution: Given 


107 Gg; =!7C, a) 


We know that 
"Cy = "Cpr .-(2) 


Comparing (1) and (2), we get 
x=n-r =107-87=20 
Thus, x=20 


Example 67: If 1270 C117 + B Gg = “Cı 117 then find the value of x. 


Solution: Given 


1270 1270 x 
Gii7+ “Cie = Giz eal) 


We know that. 


NC: ie nc l = Dc. (2) 
Comparing (1) and (2), we get 
x=n+l>x=1270+1=1271 
Thus, #=1271 


Example 68: If x. rer =23:% 1B Gee. then find the value of x. 
[B.C.A. (Bundelkhand) 2010] 


Solution: Given 

x. Cz; =23 x8 Cos (1) 
Equation (1) can be re-written as: 

x. Cys =(78=56 + 1) Ces (2) 
We know that, 

n. UC, y = =r +1) "Cy_1y (3) 
Comparing (2) and (3), we get 
x=n=78 

Hence, r=78 
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A 15 15 _16 ; 
Example 69: If °C), + °Cy = ° Cjo Find x. [B.C.A. (Avadh) 2008] 


Solution: We have 


Bgy geg aa) 
Equation (1) can be re-written as: 
DC +C, =P DCi (2) 
We know that 
"C, t Ore =C ...(3) 


Comparing (2) and (3), we get 
vert+l=ll4+l=12 
Hence, x=12 


Example 70: 


In how many different ways can all of 5 identical balls be placed in the cells shown above such that 
each row contains at least 1 ball ? [B.C.A. (Purvanchal) 2010] 
Solution: The following combinations for the given rows can be made: 


(I) 3 balls in first row can be put in 3 C3 ways 
1 ball in second row can be in 3 C] ways 
l ball in third row can be put in 3 Cı ways. 


z. In this case the total number of ways the balls can be put in the squares 
=% x? C eG 


=1x3x3=9 
Similarly, we have 


(I) 3C x3Cy x3G =3x3 x3 =27 
(I) 2C, x3C3 x3G =3x1x3 =9 
(V) 3C,x3G,x3C3 =3x3x1l=9 
(V) 30) x3G,x3C) =3x3 x3 =27 
(VI) 3C x3Cy x3) =3 x3 x3 =27 


Total number of arrangements 


=94274+9494274+27 
=108 
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Example 71: In how many ways can a committee of 5 members be selected from 6 men and 5 
ladies, consisting of 3 men and 2 ladies ? 


Solution: 3 men out of 6 and 2 ladies out of 5 can be selected in 


6x5x4 5x4 
X2 XT x 5200 ways. 


OC. ee = ——__ X 
3x2xl 2x1 


Example 72: A committee of 5 is to be formed out of 6 men and 4 ladies. In how many ways can 
this be done, when 


(a) atleast 2 ladies are included 
(b) atmost 2 ladies are included. 
Solution: 
(a) We have to make selection of 
(i) (2 ladies out of 4) and (3 men out of 6) or 
(ii) (3 ladies out of 4) and (3 men out of 6) or 
(iii) (4 ladies out of 4) and (1 man out of 6) 
The number of ways of these selections are 
Case (i) £C) x C3 =6 x20 =120 
Case (ii) *C3 x °C) =4 x15 =60 
Case (iii) 4C} x °C, =1x6 =6. 
Total number of ways to select such a committee =120 + 60 + 6 =186 
(b) We have to make a selection of 
(i) (l lady out of 4) and (4 men out of 6) or 
(ii) (2 ladies out of 4) and (3 men out of 6) 
The number of ways of these selections are 
Case (i) 4C] x °Cy =4 x15 =60 
Case (ii) £C) x C3 =6 x 20 =120 
Case (iii) number of lady is selected then 
Number of ways of selecting committee = 6 c5 =6 


n the required number of ways = 60 + 120 =180. 
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Example 73: An examination paper containing 12 questions consists of two parts, A and B, part 
A contains 7 questions and part B contains 5 questions. A candidate is required to attempt 8 
questions, selecting atleast 3 from each part. In how many ways the candidate select the questions ? 


Solution: We have, 
(i) (3 out of 7 from A) and (5 out of 5 from B) or 
(ii) (4 out of 7 from A) and (4 out of 5 from B) or 
(iii) (5 out of 7 from A) and (3 out of 5 from B) 
The number of ways of these selections are 
Case (i) “C3 x °C; =35 x1 =35 
Case (ii) ’Cy x °C4 =35 x5 =175 
Case (iii) “Cs x °C =21x10 =210 
n the number of ways of selecting the questions = (35 +175 + 210) =420. 


Example 74: If 20 lines are drawn in a plane such that no two of them are parallel and no three 
are concurrent, then in how many points will they intersect each other ? 


Solution: Since, no two lines are parallel and no three pass through the same point, 
their point of intersection, (i. e., number of ways of selecting two lines from 20 lines) 


Example 75: How many lines can be drawn through 21 points on a circle ? 


Solution: One line can be drawn by joining any 2 points, from 21 given points. 


21x20 


Total no. of lines = en = = 210. 


Example 76: There are 15 points in a plane, no three of which are collinear. Find the number of 
triangles formed by joining them. 


Solution: One triangle is formed by selecting a group of 3 points from 15 given points. 


No. of triangles = DE = “+ =455. 
x2 x 


Example 77: Everybody in a room shakes hands with everybody else. The total number of 
handshakes is 66. How many people are there in the room ? 


Solution: Let there are n persons in the room. 


Total no. of handshakes is same as the number of ways of selecting 2 persons 
among n persons, i e., "C3. 
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"Cy =66 (given) 


=j 
ON) 66 2 yp 1890 


> (n-12)(n+11)=0 =n=12 orn=-I1 
But, no. of persons cannot be negative 
n=12. 


Example 78: Determine the number of 5 cards combination out of a deck of 52 cards, if there is 
exactly one ace in each combination. 


Solution: One ace can be selected from 4 aces and 4 other cards can be selected from 
remaining 48 cards. 
No. of ways = 2G x aed 
38 x47 x46 x45 
4x3x2xl 
= 7,78,320. 


Example 79: From a class of 25 students, 10 are to be chosen for an excursion party. There are 3 
students who decide that either all of them will join or none of them will join. In how many ways can 
they be chosen ? 

Solution: There are two possibilities: 


Case I: When the 3 particular students join the party. 


In this case, we have to select 7 more students from the remaining 22 students. This can 
be done in 2” C7 ways. 

Case II: When the 3 particular students do not join the party. 

In this case, we have to select 10 students from the remaining 22 students. 


This can be done in 2? Cig ways 
So, total number of ways = 20, + = Ga 


=170544 + 646646 
_ 22x19 x17x3 x8 


(720 + 2,730) =8,17,190. 
10x9x8& 


Example 80: A committee of 12 is to be formed from 9 women and 8 men. In how many ways this 
can be done, if atleast 5 women have to be included in a committee ? In how many of these committees : 


(a) the women are in majority, (b) the men are in majority ? 


Solution: The committee can be selected in the following way: 


| 9Women | 5 | 6 | 7 | 8 | 9 | 


“eco Ee 


256 Elements of Statistics 


Total, number of ways = °C; x 8C} + ?Cg x Ce + °C; x 8C; +° Cg x PCy 
+ Ko. x 803 
=1,008 + 2,352 + 2,016 + 630 +56 
= 6,062 
Number of committees in which women are in majority 


=2,016 + 630 + 56 =2,702 


Number of committees in which men are in majority = 1,008. 


Example 81: If m parallel lines in a plane are intersected by a family of n parallel lines, find the 
number of parallelograms formed. 


Solution: A parallelogram is formed when two lines from the set of m parallel lines are 
intersected by the other 2 lines from another set of parallel lines. 
Hence, the number of parallelograms formed in this case 
= "Cy x "Cy 

m (m—1) yt (n-1) 

2 2 
mn (m —1) (n-1) 

F ; 


Example 82: A student is allowed to select at most nbooks from a collection of (2n + 1) books. If the 


total number of ways in which he can select at least one book is 255, find the value of n. 


Solution: The student may select at most n books out of (2n + I) books. Thus, according 


to the given condition, 
le + nE ape Zate CORINE: S mes C =255 


= se BONG De 4 ANC, +... OG =255 
E i 
z S [eer i anal 4 pertle + anto Fal 
T emg, p? ceeds Sia =255 
i 
T 5 [e"+tc 4 es ee a Clg T ia oe) to, 
+ pme + ial SMe =255 (2 "C, =™"Cm-r) 
5 a a a ern 
l ‘ es 
= A eG, + cau 4 2n iG, Fent n Casi) 


=r Ea iia i] =255 


> xen -(1+1)]=255 ("Cy + "Cl + "Cy t+... + "Cy = 2") 
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=> 2?”+1_3 =510 
=> gel -512 
5 92n+1 =929 
> 2n+1=9 
=> n=4. 


Example 83: A committee of 5 is to be formed out of 6 gents and ladies. In how many ways this 
can be done when 


(i) atleast 2 ladies are included, 


(ii) at most 2 ladies are included. 


Solution: Total gents = 6, Total ladies =4 


l. At least 2 ladies out of 4 ladies are included in the following ways 


(i) 3 gents out of 6 gents + 2 ladies out of 4 ladies = 6C, x 4C) ways 
8 § 3 2 Way’ 


(ii) 2 gents out of 6 gents + 3 ladies out of 4 ladies = Te x 1C 
Gii) 1 gent out of 6 gents + 4 ladies out of 4 ladies = oe! x tC 
Thus, the total number of ways of forming the committee 
= C3 xC, +C, x C3 + OQ x *Cy =120 + 60 + 6 =186 
2. At most 2 ladies out of 4 ladies are included in the following ways: 
(i) | No lady and 5 gents = ae ways 
(ii) 1 lady and 4 gents = ta x to ways 
(iti) 2 ladies and 3 gents = T x OCs ways 
Thus, the required ways of forming committee 
=G tx at ko 
=6 +60 +120 =186. 
Example 84: A polygon has 44 diagonals. Find the numbers of its sides. 


Solution: The number of diagonals of n sided polygon 


="Cy _p p- #023) 


2 
Given that: É 0—5 =44 
> 1 -3n =88 
> w -3n-88=0 
> (n-11)(n+8)=0 
=> n=11 Ce n>0) 
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Example 85: A box contains 5 different red and 6 different white balls. In how many ways can 6 
balls be selected so that there are at least two balls of each colour ? 


Solution: Total red balls =5 


Total white balls =6 


6 balls can be selected from the box so that there are at least two balls of each colour in 
the following ways: 


(i) 2 red balls and 4 white balls = °C) x °Cy ways 
(ii) 3 red balls and 3 white balls = ae x oc. ways 
(iii) 4 red balls and 2 white balls = °Cy x °C, 
Hence, the required ways = a x oC, + Ta x oC. + IC x °C 
=150 +200 +75 =425 


Example 86: In how many ways can 21 identical books on English and 19 identical books on 
Hindi be placed in a row on a shelf so that two books on Hindi may not be together ? 


Solution: Let E denote the position of an English book and H of Hindi book on a shelf. 


Since no two books of Hindi must be together on the shelf therefore, we first arrange all 
books of English in a row. Since all the books of English are identical so they can be 
arranged in a row in only one way as shown below: 


HEHEHE H...... HEHE 
As there are 21 books of English so there will be 22 places for Hindi books. 
Now we have 19 identical books of Hindi, therefore out of 22 places, 19 places. 


[22 _ 22x 21x20 


ee =1540 
[19 [22-19 "3 x2xl 


Total no. of ways = a Oe = 


Example 87: In how many ways can a cricket team be selected from a group of 25 players 
containing 10 batsmen, 8 bowlers, 5 all-rounders and 2 wicket keepers ? Assume that the team of 11 
players requires 5 batsman, 3 all-rounders, 2 bowlers and 1 wicket keeper. 


Solution: The selection of team is divided into 4 phases: 


(i) 5 batsman out of 10 batsman can be selected in 19 C5 ways. 

(ii) 3 all-rounders out of 5 all-rounders can be selected in > C3 ways. 
(iii) 2 bowlers out of 8 bowlers can be selected in a Cy ways. 

(iv) 1 wicket keeper out of 2 wicket keepers can be selected in 2 Cı ways. 


. Team can be selected in ioe x `C x e x 20 ways = 1,41,120 ways. 
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Example 88: In how many ways can a committee of 3 ladies and 4 gentlemen be appointed from a 
group consisting of 8 ladies and 7 gentlemen ? What will be the number of ways, if Aishwarya of this 
committee refuses to serve in a committee in which Salman is a member ? 


Solution: 


(i) 3 ladies can be selected from 8 ladies in a a ways and 4 gentlemen from 7 


entlemen in ‘C4 ways. 
8 4 Way 


"n No. of ways in which the committee can be appointed 
=8C, x C4 =56 x35 =1,960. 

(ii) First we find the no. of committees of 3 ladies and 4 gentlemen in which both 
Aishwarya and Salman are members. In this case, we can select 2 other ladies from 
the remaining 7 in ee ways and 3 other gentlemen from remaining 6 in 6 C3 ways. 

No. of ways in which both Aishwarya and Salman are always included 
= "Cy x C3 =21x 20 = 420. 
Hence, the required no. of committees in which Aishwarya and Salman do not 
serve together 
= 1,960 —420 =1,540. 


Example 89: In how many ways a committee of 5 persons can be formed out of 7 Indians and 4 
Pakistanis when the committee contains: 


(i) All Indians 

(ii) At least two Pakistanis 

(iii) At least two Indians 

(iv) Not more than two Pakistanis 


(v) Not more than two Indians [B.C.A. (Meerut) 2008] 


Solution: 


(i) When committee consists of all its members as Indians. The number of ways 
selecting the committee = es x t =21 


(ii) When at least two Pakistanis are selected, then the committee can be selected as 
follows: 


Case I: When 2 Pakistanis out of 4 and 3 Indians out of 7 are selected. Then 
number of ways selecting the committee = E x es =210 


Case II: Number of ways selecting 3 Pakistanis and 2 Indians = 4C x iG; = 84 
Case III: Number of ways selecting 4 Pakistanis and 1 Indian = er x fas S 


Total number of ways selecting the committee 


=210 +84 +7=301 
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(iii) 


(iv) 


When at least two Indians are selected. Then number of ways selecting the 
committee 


=C; RCs +10 x *C +704 x7G, + (Es RAG 
=84+210+168+21=483 


When not more than two Pakistanis are selected. Then number of ways selecting 


the committee = Ci x en + 4c x "Ca 
=140 +840 =980 


When not more than two Indians are selected. 


Then number of ways selecting the committee 


='C, x4, + "Cy x4C, =74+84 =91 


\ Exercise } 

Verify that 8 C4 + C3 =9Cy. 
G) If"C7 ="Cs, find n. 
(ii) If 29°C, =2°C, , ¢, find r. 
Gii) If "P. =1680 and "C, =70, find nand r. 
(iv) If"*!C, 4): "C, =11:6 and "C, :""!C,_, =6:3, find mand r. 
If 16C, :19C,_] =10:5 find r. 
If 18C, =18C, 9, find "C3. 
Find 25 Co = A Oy, 
In how many ways can a cricket team be chosen out of a batch of 15 players, if 
(a) there is no restriction on the selection 
(b) a particular player is always chosen 
(c) a particular player is never chosen. 

[Hint: (a) °C, (b)'4Qq (©) 14G] 


Out of 5 men and two women, a committee of 3 is to be formed. In how many ways 
can it be formed if at least one women is to be included ? 
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10. 


l1. 


12. 


13. 


14. 
15. 


16. 


17. 


18. 
19. 


20. 


21. 


From a class of 32 students, 4 are to be chosen for a competition. In how many 
ways can this be done ? 


In how many ways can a student choose 5 courses out of 9 courses if 2 courses are 
compulsory for every student ? 


A question paper has two parts, part ‘A’ and part ‘B’, each containing 10 questions. 
If the student has to choose 8 from part ‘A’ and 5 from part ‘B’. In how many ways 
can he choose the questions ? 


From a class of 25 students, 10 are to be chosen for an excursion party. There are 3 
students who decide that either all of them will join or none of them will join. In 
how many ways can the ten be chosen ? 


A cricket team of 11 players is to be selected from 16 players including 5 bowlers 
and 2 wicket keepers. In how many ways can a team be selected so as to consist of 
exactly 3 bowlers and 1 wicket keeper ? 


(i) How many straight lines can be obtained by joining 12 points, 5 of 
which are collinear ? 


(ii) How many triangles can be obtained by joining 12 points, five of which are 
collinear ? 


A polygon has 44 diagonals. Find the number of its sides. 


In a village, there are 87 families of which 52 families have at most 2 children. In a 
rural development programme 20 families are to be chosen for assistance, of which 
at least 18 families must have at most 2 children. In how many ways can the choice 
be made ? 


A bag contains 4 red, 3 white and 2 blue marbles. 3 marbles are drawn at random. 
Determine the number of ways of selecting at least 1 white marble in this selection. 


In how many ways can 5 white balls and 4 black balls be arranged in a row so that 
no two black balls are together. 


A man has 6 friends. In how many ways can he invite one or more to a party ? 


A boy has 3 library tickets and 8 books of his interest in the library of these 8, he 
does not want to borrow chemistry part II, unless chemistry part I is also borrowed. 


Out of 6 teachers and 8 students, a committee of 11 is to be formed. In how many 
ways can this be done, if the committee contains 


(i) exactly 4 teachers, 
(ii) at least 4 teachers. 


In how many ways can a cricket team be selected from 17 players in which 5 
players can bowl ? Each cricket team must include 2 bowlers ? 
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Answers 


(i) 12) (ii) 7 (ili) n=8,r =4 (iv) n=10,r=5 
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Chapter 


Probability 


7.1 Introduction 


There are two approaches to the theory of probability viz. (a) Classical approach and (b) 
Set-theoretic or Axiomatic approach. We begin the chapter with the classical approach. 


7.1.1 Classical Approach to Probability 


(A) Random Experiment and Events: A random experiment may be defined as an 
experiment which when repeated under essentially identical conditions does not 
give unique results but may result in any one of the several possible outcomes. 
These outcomes are known as events or cases. Events are denoted as A, B, C etc. 


For Example: 
(i) Getting a head (H) or a tail (T) is an event when we toss a coin. 
(ii) Getting any of the six faces 1, 2, 3, 4, 5, 6 is an event we throw a die. 


(iii) Getting an ace or a king or a queen is an event when we draw a card from a 
pack of well-shuffled cards. 


(B) Exhaustive Events: The total number of possible outcomes in a random 
experiment are known as exhaustive events. 
For Example: 
(i) There are 2 exhaustive events viz. head (H) and tail (T) when we toss a coin. 


(ii) There are 6 exhaustive events when we throw a die. 


(C) 


(D) 


(E) 
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(iii) 


(iv) 
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When we throw two dice the exhaustive number of events is 62 = 36 given as 


follow: 

(1,1), (1,2), (1,3), (1,4), (1,5), (1,6); 
(2,1), (2,2), (2,3), (2,4), (2,5), (2,6); 
(3,1), (3,2), (3,3), (3,4), (3,5) 4 (3,6); 
(4,1), (4,2), (4,3), (4,4), (4,5), (4,6); 
(5,1), (5,2), (5,3), (3,4), (5,5), (5,6); 
(6,1), (6, 2), (6,3), (6,4), (6,5), (6,6); 


When we draw two cards from a pack of playing cards, the exhaustive 


number of events is 27 Cy, since 2 cards can be drawn out of 52 cards in P2 Cə 


ways. 


Favourable Events: The events which cause the happening of a particular event A 


are called the favourable events to the event A. 


For Example: 


(iii) 


There are three favourable events for the occurrence of an even number (or 


an odd number) in the throwing of a die. 


When we draw a card from a pack of cards, there are 4 favourable events for 
the drawing of an ace ; there are 12 favourable events for the drawing of a 
face card (king, queen, jack). 


When we throw two dice together, the number of events favourable to 
getting a sum of 9 is 4 given by the following combinations : 


(3, 6), (4, 5), (6, 3), (5, 4) 


Mutually Exclusive Events: Such events where the occurrence of one rules out 


the occurrence of the other, are called mutually exclusive events. 


For Example: 


(i) 


(ii) 


(iii) 


In tossing a coin there are two mutually exclusive events, for if head comes in 
a trial, then tail cannot come in the same trial or vice versa. 


In throwing a die there are 6 mutually exclusive events, for if any of the six 
faces comes, then the possibility of other faces in the same trial is ruled out. 


There are 52 mutually exclusive events in drawing a card from a pack of 
cards. 


Equally Likely Events: The events are said to be equally likely if none of them is 


expected to occur in preference to other. 


For Example: 


(i) 
(ii) 


In tossing a coin, there are 2 equally likely viz. H and T. 


In tossing a die, there are 6 equally likely events viz. 1, 2, 3, 4, 5, 6. 
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7.1.2 Classical Definition of Probability 


If there are n exhaustive, mutually exclusive and equally likely events out of which m are 
favourable to the happening of an event A, then the probability of the happening of A, 
denoted by P[A] is defined as 

Favourable number of cases _ m 


Pa ee ee eee 


Exhaustive number of cases n 
Remarks: 


l. It is clear that if m cases out of n exhaustive cases favour the happening of an event 
A, then n — m cases favour that the event A will not happen. Thus, the probability 
that the event A will not happen denoted by P [A], is given by 

paje ZPA] 
n n 
P[A]+ P[A]=1 (1) 
[B.C.A. (Meerut) 2003] 
m 


2; Since 0 <m <n, so0<—<1l 
n 


Hence, 0 < P [A] <1, for any event A. 
If P [A] =0, then A is called an impossible or null event. 
If P [A]=1, then A is called a certain or sure event. 


32 The probability P[A] of the happening of an event A is also known as the 
probability of success, denoted by p and the probability P[A] of the 


non-happening of the event A is known as the probability of failure denoted by 4. 
From (1), it follows that: 
p+q=LOspsl Osqsi). 


4. If A and B are two events, then the probability of the happening of A or B (i.e. at 
least one of the two events) is denoted as P [A + B] or P [A V B]. The probability of 
the simultaneous occurrence of two events A and B is denoted by 
P[AB] or P[A aB]. 

5. The following results from permutations and combinations are very useful: 

(i) nl=n(n-))...3,2, Lh n!=n(n-l)!=n(n-I) (n-2)!ete. 


(ii) The number of permutations of n different things taken r at a time is 
n! 


(iii) The number of combinations of n different things taken r at a time is ”C, or 


n 
where 
; 


r) ri(n-r)! 
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For Example: 


6 6! 6.5 10 10.9.8 
= °C) =—— == =15, =l0C, = 19 
214! 2.1 3 53, 


=270725 


52) 59, _ 52.51.50.49 
T ME te 


\ Solved Examples ¥ 


Example 1: What is the chance that a leap year selected at random will contain 53 sundays? 
Solution: A leap year consisting of 366 days has 52 complete weeks and 2 days extra. 

The following are the possible combinations for there two extra days: 

(i) | Sunday and Monday 

(ii) Monday and Tuesday 

(iii) Tuesday and Wednesday 

(iv) Wednesday and Thursday 

(v) Thursday and Friday 

(vi) Friday and Saturday and 

(vii) Saturday and Sunday 

In order that a leap year selected at random should contain 53 Sundays, one of the two 

extra days must be Sunday. Since out of the above 7 possibilities 2 viz; (i) and (vii) are 


favourable to this event, the required probability == 


Example 2: (i) Two cards are drawn at random from a well-shuffled pack of 52 cards. Find the 
probability of getting 2 aces. 


Solution: Total number of cases in which 2 cards can be drawn out of 52 cards is 


n= aC, There are 4 aces in a pack of 52 cards and out of 4 aces, 2 aces can be drawn is 


m= aC ways. 


4 
Required Probability = ae ewe = ae = en 
n C) 52.51 221 


(ii) In a single throw of ‘2’ dice, what is probability of obtaining a total 10 ? 
[B.C.A. (Meerut) 2003] 


Solution: Total numbers of cases =6 x 6 =36 


Find problem of obtaining a total of 10. 
Favourable outcome = (4, 6), (5,5), (6,4) 
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Total number of favourable cases = 3. 


3 
Th bability of throwing 10 = — = — 
e probability of throwing 36° DD 


Example 3: A bag contain 6 white, 7 red and 5 black balls. Find the chance that three balls drawn 


at random are all white. 


Solution: 
18x17 x16 
Total number of cases = Be = RG 
3x2xl 
6x5x4 
Favourable number of cases = °C, a XT 20 
3x2xl 
20 5 
Required probability = — = —— 
equired probability = -<77 = 703 


Example 4: Find the probability of getting an even number in a throw of a single die. 


Solution: A die has six faces bearing numbers 1 to 6. 
Total number of exhaustive cases =6 i.e. n=6 
out of these six numbers, there are three even numbers viz. 2, 4 and 6. 


Total number favourable cases =3 i.e. m=3. 


Required probability = "z > = 
1 


l 
7 2 
Example 5: From a pack of 52 cards, two cards are drawn at random. Find the chance that one is 


a king and the other a queen. 
Solution: Total number of cases is n = ae 


Since there are 4 kings and 4 queens, so the number of favourable cases is m = aC; x i On 


4qx4*q 44 TE. 
526, 52.51 663 


Required probability = = 
n 


Example 6: Four persons are chosen at random from a group containing 3 men, 2 women and 4 
children. Obtain the chance that at the most 2 of them are children. 

[B.C.A. (Meerut) 2002] 
Solution: We have a group of 3 men, 2 women and 4 children and we have to choose 4 


persons in a way. Four persons can be chosen in the following ways: 


4 3 
(i) 1 child, 2 man, 1 woman probability = otis a = 
Cy 
4 3 
(ii) 1l child, lman, 2 women probability = aos’ 
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2 2 
(iii) 2 children, 1 man, 1 woman probability = 22 = X Gi 
4 
4 3 2 
(iv) 0 child, 3 men, 1 woman probability = — — xG 
4 


4 
(v) O child, 2 men, 2 women probability = 


Since only one case is possible so required probability will be the sum of all these 


cases = — 


126 


Example 7: A ball is drawn at random from a box containing 6 red balls, 4 white balls and 5 blue 
balls. Determine the probability that the ball drown is: 


(i) red (ii) white 
(iii) blue (iv) not red 
(v) redor white. [B.C.A. (Meerut) 2009] 


Solution: Let R, W and B denote the events of drawing a red ball, a white ball and a blue 
ball respectively. 


Number of favourable outcomes _ n(R) 


(i). PR) = A 
Total number of outcomes n(S) 
6 6 24 
“64445 15° 5 ` 
(ii) PW) — 2 — =" 2036 
64+44+5 15 
(iii) P(B)= 2 -2 -1033 


ise e 2006 
I5 3 


Example 8: If four ladies and six gentlemen sit for a photograph in a row at random what is the 


probability that no two ladies will sit together ? 


Solution: 4 ladies + 6 gentlemen = 10 persons is all 
They can sit in a row in Hry =10! ways 


Then, n(S) =10! 
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Now, if no two ladies are to be together, the ladies have 7 position, 2 at ends and 5 
between the gentlemen arrangement L, G, L, Go, L, G3, L, Gy, L, G5, L, Ge, L. The 4 ladies 
can occupy these 7 seats in P, ways and 6 gentlemen can occupy 6 seats in 6p. ways. 


Therefore, the total number of these arrangements = p x OP, ways. 


Therefore, the required probability = a 
otal no. of outcomes 


Example 9: In a class there are 13 students 5 of them are boys and the rest are girls. Find the 
probability that two students selected at random will be both girls. 


Solution: Two students out of 13 can be selected in !3 Cy ways and two girls out of 8 
can be selected in 3e ways. 


8 
DRE =036 


Therefore, ired probability = 
erefore, required probability = y5 C, 39 


Example 10: A box contains 5 white balls, 4 black balls and 3 red balls. Three balls are drawn 
randomly. What is the probability that they will be (i) white, (ii) black, (iii) red ? 
Solution: Let W, B and R denote the events of drawing three white, three black and 
three red balls respectively. 
5 5 
: n(W C: C: l 
(i) pw) =") -___©3s__ = 
nS) (5+4+3)Cs C3 22 


n(B) 4C 1 
nS) 1C 55 


nR) °C 1 
nS) !?C, 1320 


Example 11: A bag contains 6 red, 5 white and 4 black balls. Two balls are drawn. Find the 
probability that none of them is red. [B.C.A. (Rohilkhand) 2006] 


Solution: Total number of balls =6+5+4=15 


15x14 
xl 


Number of exhaustive events = Bes = =105 


Total number of non red balls =5 +4 =9 


Number of favourable events = 2C 


9x8 _36 
2x] 
36 12 


Required probability = TR = a5 
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Example 12: If from a lottery of 30 tickets, marked 1 to 30 four tickets be drawn, what is the 
probability that those marked 1 and 2 are among them ? [B.C.A. (Meerut) 2010, 2009] 


Solution: Four tickets can be drawn out of 30 in 30 Cy ways. If the ticket marked 1 
and 2 are to be always included, then 2 more can be drawn out of the remaining 28 in 
28C») ways. 
28C, 

30 C4 


28x27 
_ 2 xl 
30x29x28x27 
4x3x2xl 
_ 28x27x4x3x2x] _ 2 
~ 30x29 x28x27x2x1 145 


Required probability = 


Example 13: A drawer contains 50 balls and 150 nuts. Half of the bolts and half of the nuts are 
rusted. If one item is chosen at random, what is the probability that it is rusted or in a bolt ? 
[B.C.A. (Garhwal) 2010] 


Solution: Let ‘A’ = {the item is rusted} 
and ‘B’ = {the item is a bolt} 


Here, A and B are non-mutually exclusive events 
P(A UB) =P(A)+P(B)-P(ANB) 


_100 50 25 _150-25 _125 _5 
8 


~ 200 200 200 200 


00 


N 


Required probability = > 


Example 14: In a single throw of two dice, find the probability that neither a doublet (same 
number on both the dice) nor a total of 9 will appear. [B.C.A. (Lucknow) 2008, 2004] 


Solution: Number exhaustive cases = 6 x6 = 36 cases favorable to a doublet are: 
(1,1), (2,2), (3,3), (4,4), (5,5), (6,6) i.e. 6 ways 


P (probability of getting a doublet) = £ = Z cases favorable to a total of 9 cases are 


(3,6), (6,3), (4,5), (5,4) i.e. 4 ways 
l 


P (probability of getting a total 9) = — = 9 


Since the two events are mutually exclusive 
P(a doublet or a total of 9) = P (a doublet) + P (a total of 9) 
1 1.342 _ 5 


Zp 
6 9 18 18 
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P(neither of doublet nor a total of 9) =1—P (a doublet nor a total of 9) 
5 _ 18-5 _13 


18 18 18 


Example 15: A class consists of 80 students, 25 of them are girls and 55 boys, 10 of them are rich 
and the remaining poor, 20 of them are fair complexioned. What is the probability of selecting a fair 


complexioned rich girl ? [B.C.A. (Lucknow) 2006; B.C.A. (Kanpur) 2005] 


Solution: Probability of selecting a fair complexioned person 


Probability of selecting a rich person 


l 
8 
Probability of selecting a girl = 23 = 2 

8 16 
Since these events are independent of each other, therefore the probability of selecting a 


Le? a =0.009 


i 
fai lexioned rich girl =—xX=x— =—_ 
air complexioned rich gir rigs ar SE 


Example 16: A bag contains 6 white and 9 black balls. Four balls are drawn at a time. Find the 
probability for the first draw to give four white and second draw to give four black balls in each of the 
following cases: 


(i) The balls are replaced before the second draw 
(ii) The balls are not replaced before the second draw 


Solution: 


(i) When the balls are replaced before the second draw. Here two draws are 
independent 


P(A ^B) = P(A)xP(B) 


6C, 
Probability of drawing 4 white balls = P(A) = 2 
: ISCy 
Pe ; 9 C4 
Probability of drawing 4 black balls = P(B) = 
15 C4 
Now, P(A ^B) = P(A) x P(B) = eo 
15Cy 15C, 
6x5 x4x3 y 9x8x7x6 _ 6 


~ I5xl4x13x12 l5xl4x13xl12 5915 
(ii) When balls are not replaced before the second draw. 


Here two events are not independent. 
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Probability of drawing 4 white balls = = 
15C4 91 


Now, the balls drawn have not been replaced, therefore the bag contains 2 white 
and 9 black balls. 


Probability of drawing 4 black balls 
9C4 _21 


P (BIA) = 
lc. 55 


ISG, Wc, 51 55 715 


Example 17: An win contains 7 red and 4 blue balls. Two balls are drawn at random with 
replacement. Find the probability of getting bag. [B.C.A. (A.M.U.) 2011] 


(i) 2 red balls (ii) 2 blue balls (iii) are red and one blue ball. 
[B.C.A. (Rohtak) 2010] 


Solution: 
G) P (2 red balls) = ax 749 
IT IL 121 
ay PO huspig=e sa 
ll ll 121 
(iii) P (one red and one blue ball) = 1 TEN = xL 
ll IL 11 Il 
28 28 56 


= —— + — = — 
121 121 121 


Example 18: Two dice are thrown. Find the probability that a multiple of 2 occurs on one dice and 


multiple of 3 occurs on the other. [B.C.A. (Agra) 2007] 


Solution: Two dice can be thrown in 6 x6 =36 days 


Multiples of 2 are 2, 4, 6 and multiple of 3 are 3, 6. 


Favourable cases for getting multiple of 2 on one dice and multiple of 3 on the other are: 
(2,3),(2,6), (4,3), (4,6), (6,3),(6,6), (6,4), (6,2), (3,2),(3,4) and (3,6) i.e. 11 ways. 


required probability = = 
Example 19: In a single throw of three dice, find the probability of getting a total of 17 or 18. 
[B.C.A. (Meerut) 2008] 
Solution: Three dice can be thrown in 6 x6 x6 = 216 ways 4 total of 17 can be obtained 
i (5,6,6),(6,5,6),(6,6,5) 
A total of 18 can be obtained as (6,6,6) 


No. of class favorable to a total of 17 or 18 are 4 


P (a total of 17 or 18) = 4 l 
216 54 
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Example 20: Two dice are thrown. Getting two numbers whose sum is divisible by 4 or 5 is 
considered a success. Find the probability of success. [B.C.A. (Agra) 2005] 


Solution: To obtain sum divisible by 4 we must get a total of 4 or 8 or 12 and to obtain 
sum divisible by 5, we obtain a total of 5 or 10. Hence the probability of success is P(4 or 
8 or 12 or 5 or 20) = P(4) + P(8) + P (12) + P (5) + P(10). 


A total of 4 can be obtained as (1,3), (3,1), (2,2) i.e. 3 ways 


3 
P(4) => 
(4) 36 
5 l 4 3 
Similarly, P(8) =—,P(12) =—, P(5) = — and P(10) = — 
a rg gt Cale ag Olea eae 
Hence the required pepe. 2 ge 
36 36 36 36 36 
_16_4 
36 9 


Example 21: A bag contains tickets numbered from 1 to 20. Two tickets are drawn. Find the 
probability that both the numbers are prime. [B.C.A. (Kurukshetra) 2011, 2008] 


Solution: Total numbers of cards = 20 


The probability of getting 2 cards from 20 = Ne 
The prime number in between | to 20 = 8 
The probability of getting both number to be prime = s0 


8C, 8x7 28 


. required probability =e, = ITE S 


Example 22: Two cards are drawn from a pack of 52 cards, one after another without replacement. 
Find the chance that one of these is an ace and other is queen of opposite shade. 


Solution: There are two possibilities: 


(i) The first card drawn is an ace and the second drawn is a queen of opposite shade 


(colour). The probability of this compound event = Š x 2 


(ii) The first card is a queen and the second is an ace of opposite shade. The probability 


of this event = — ee 
52 51 
Hence, the required probability. 
4 2 4 2 2 2 
=— x2 + — XS = — + — 
52 51 52 51 13x51 13x51 
2 2 4 
+ 


= 663 663 663 
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Example 23: Tim Tim and Vishu appear for an interview for two vacancies. The 


probability of Tim Tim’s selection is > and that of Vishu’s selection is ~ Find the 


probability that only one of them will be elected. 


Solution: P (Tim Tim selection) -2 
; - 1 2 
P(Tim Tim selection) =1-~ == 
3 3 
P (Vishu'’s selection) = - 


required probability =—x £ + 


3 3 
4 2 6 
— + — = — 
15 15 15 


7.1.3 Theorem of Total Probability 


If n events Aj, AÁ ,..., A, are not mutually exclusive, then the probability of the 
happening of at least one of the events is the sum of the probabilities of the individual 
events. In symbols 
P [A + Ag +... + Ay] = p [Ay] + P [A] +... +P [Ay] 
Or 
P[A, U Ag U...U Ay] =P [A)] + P [Ag] +...+P [Ay] 


Proof: Let N be the total number of mutually exclusive, exhaustive and equally likely out 
of which m, cases are favourable to Aj, m cases are favourable to Ag and so on. 


The probability of occurrence of the event 4 =P [4] = + 


The probability of occurrence of the event Ay = P [A] = ~ well) 


The probability of occurrence of the event A, =P [A,,] = m 
The events being mutually exclusive and equally likely, the total number of cases 
favourable to the event Aj or A or ... A, is 
m=m +m +...+ My wil2) 
m 


P[A, + Ay...+ A,] =— 
[ 1 2 n] N 


_ m +m+...m 


usin al 

N g (3) 
ait, p 
N N N 


=P[A]+P[4]+...P [Ay] 
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7.1.4 Compound Events 


The simultaneous occurrence of two or more events is called a compound event. 


Conditional Probability 
The probability of the happening of an event A when the event B has already happened is 
called the conditional probability and is denoted by P |4] 


Similarly, P [ž] means the probability of the happening of an event B when the event A 


has already happened. 


Independent Events 


Two events are said to be independent if the probability of the happening of one does 
not depend on the happening or non-happening of the other. 


7.1.5 Theorem of Compound Probability or Multiplicative Law 
of Probability 
If A and B are two events, then 


B 
P [AB] =P [A] P =| 


Proof: Suppose there are n exhaustive, mutually exclusive and equally likely cases, out of 
which m cases are favourable to the happening of the event A. 


m 


P[A]=— .(1) 
n 
Let mout comes be favourable to the happening of the event B, Then 
P =| _™ (2) 
A m 


Now out of n exhaustive, mutually exclusive and equally likely out comes, m are 
favourable to the happening of A and B 


The probability of the simultaneous occurrence of A and B is 


Papja amy nM m 
n non m 
Hence, P [AB] =P [A] P =| using (1) and (2) 


Cor. 1: Interchanging A and B in the above result, we get 


A 
P [BA] =P [B] P |4] 
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A 
Hence, P [AB] =P [B] P z [- P [AB] =P [BA]] 
Cor. 2: If A and B are independent events, then 
P [AB] =P [A] P [B] 
Proof: We have P 5] =P [B], since A and B are independent. Hence 


P [AB] =P [A] P [B] 
Cor. 3: If Aj, A ,..., A, are n independent events, then 
P [A Ag ... An] =P [A] P [Ag]... P [And 


Cor. 4: If py, >...» Py, are the probabilities that certain events happen, then the 


probabilities of their non-happening are 1 — p,,1—- P ...,1— p,. The probability of all the 
events not happening (1 — p) l- p)... 0- py). 


Hence, the probability that at least one of these events happens 
=1-[0-A)(-P)...-P,)] 

Example 24: What is the chance of throwing a total of 5 or 11 with two dice ? 

Solution: Exhaustive number of cases = 67 =36 


Let P [A] denote the probability of getting a total of 5 and P [B] the probability of getting 
a total of 11. 


Favourable cases of getting a total of 5 are 


(1, 4); (4, 1); (2 , 3); (3, 2) 


4 
P [AJ=— 
[A] 36 
Favourable cases of getting a total of 11 are (5, 6) : (6, 5) 
2 
P [B] == 
[B] 36 


The probability of getting a total of 5 or 11 are 


PAs BPA re es = ed 
3636 36 6 


Example 25 : It is 8: 5 against a husband who is 55 years old living till he is 75 and 4 : 3 against 
his wife who is now 48, living till she is 68 Find the probability that 

(i) the couple will be alive for 20 years hence 

(ii) one at least of these will be alive for 20 years since. 


Solution: Let A and B respectively denote the events of the husband and wife living 20 


years hence. Then 5 . 5 
=—— = ,P[B]= = 
8+5 13 7 


P [A] 
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=> P[A]= = and P [B] = 


(i) Probability that the couple lives for 20 years 


5 3 15 

=P [AB] =P [A] P [B] == x= = — 

[AB] =P [A] P [B] 13 *7 9] 
ix a ; Ž e: 
(ii) Probability that none of A and B lives = P [A A] P [B] = — Xo. 


Hence, the probability that at least one of A and B lives 


228,458 
=o 1 “7791 


Example 26: A problem in statistics is given to the three students A, B and C whose chances of 


solving it are : ; : and n respectively. What is the probability that the problem is solved ? 


Solution: The probabilities that the ie is solved by the students A,B and C are 


l 
4 


respectively 


P[A]=~,P [B]== Z and P[C]= 


l 

2’ 
Now, P [A] = i m2. pct 
2’ 3 4 


The problem will be solved if at least one of then solves it. The probability that none of 


th 1 it ,i = = = 
em solves It ,1s -P [A] P [B] P [0] -1.2.2 = 
34 4 
Hence, the probability that the problem is solved 
-1-4-2 
4 4 


Similar Example: A problem in statistics in given to the three students A, B and C whose chances 


of solving it are > 7 and E respectively. What is the probability that the problem will be solved ? 


(=) Ans. 

32 

Example 27: p is the probability that a man aged x will die within a year. Find the probability 
that out of five men A, B, C, D, E each aged x, A will die during the year and the first to die. 


Solution: Since the probability that a man aged x will die in a year is p, the probability 


that the man does not die in a year = (1 — p). The probability that none of the five men 
each aged x dies in a year = (1 — př. The probability that at least one of the 5 men will die 


during the year ={l-(1- py}. It may be noted that one man may be any one of 
A, B, C, D, E. The probability that A will be the first to die = 7 . Hence, the probability 
that A will die during the year and be the first to die. 


ah ge A 
=ou (l- py} 
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Similar Example: pis the probability that a man aged x will die in a year. Find the probability 
that out of nmen Aj, Ag ,..., Ay each aged x, A, will die in a year and be the first to die. 


+ ree p)"Ans.| 

n 

Example 28: A speaks truth in 75% and B in 80% of the cases. In what percentage of cases are they 
likely to contradict each other in stating the same fact ? 


Solution: Probabilities that A and B speak the truth are given by 


pe SS pB] -20 -4 
100 4 100 5 
Probabilities that they do not speak the truth are 
pial epit 
4 4 5 5 
Probabilities that they contradict each other is 
> — 3 1 4 1 7 
=P[A]P[B]+ P[B]P [A] =>x-= + = x= =— 
[A] P [B] [BIL [A] = E T 


Hence, A and B are likely to contradict each other in = x 100 =35% of the cases. 


Example 29: Probability that a boy will pass an examination is >and that for a girl is £ . What 


is the probability that at least one of them passes ? 


Solution: The probability that the boy passes is P [A] = 2 : 


The Probability that the girl passes is P [B] = Z : 

The probability that at least one of them passes is 
=P [A+ B]=P [A] + P [B] -P [AB] 
= p[A] + P [B] -P [A] P [B] 

3 + 2 32_ 19 


Example 30: If from a bag of 6 red, 5 white and 4 black balls two balls are drawn, find probability 
that none of them is white. [B.C.A. (Meerut) 2003] 
Solution: Given 6 Red Balls, 5 White Balls, 4 Black Balls 

Total number of balls =6 + 5 +4 =15 balls 
2 balls drawn 


Find none of them is white. 
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6x5 
7 í 7 ) 6 (6 x4) 
=| 75x14 d5] 15x14 
| 2 ) £ 
15 6 24 
-105 105 105 
45 3 
“105 7 


Example 31: A bag contains 6 white and 9 black balls. Four balls are drawn at a time. Find the 
probability for the first draw to give 4 white and the second to give 4 black balls in each of the 
following cases: 


(i) The balls are not replaced before the second draw. 


(ii) The balls are replaced before the second draw. [B.C.A. (Agra) 2004] 


Solution: 
(i) | We can draw 4 balls from a bag containing 6 white and 9 black balls in er ways. 


Let A be the event that the first draw gives 4 white balls and B be the event that the 
second draw gives 4 black balls. We draw 4 white balls out of 6 white balls in en 


ways. 


Now, if the drawn balls are not replaced, we can draw the other 4 balls in 11 C4 ways. 


The event B that the second draw results in 4 black balls (on the assumption that the 
first draw has given 4 white balls) has 2 C4 favourable cases. 


A ie 
6 9 
Hence, Pjab=P apil- 
B Cy ° 715 
oC, 
(ii) We have PA sie 
4 


Since the balls are replaced after the first draw, the probability of drawing 4 black 
9 C4 
15 Cj : 


balls in the second draw is 
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P [AB] =P [A].P [B] as A and B are independent events. 


6 9 
C4 y Cy _ 3 


_— POO BEBE, 2968 


Example 32: A and B toss a coin alternately on the understanding that the first to obtain head 
wins the toss. Show that their respective chances of winning are 2/3 and 1/3. 


Solution: Probabilities that A and B get head in a toss are given by 
P[A]=P[B]=} 


n Ts 1l 
PIA]=[B]=1-5=3 


If A begins, he can win in following mutually exclusive ways 
(A), (A B A), (AB AB A),... 
Therefore, by the theorem of total probability, probability that A wins 


=P[A]+P[AB A]+P[AB AB _A]}+... 


=P [A] + P [A] P [B] P [A] + {P [A] P [B}P [A] +... 


Since total probability is unity and one of the players is to win. Probability that B wins 
l 
3 


1-2 

3 
Example 33: A husband and wife appear in an interview for two vacancies in the same post. The 
probability of the husband's selection is and that of wife's selection is ~ . What is probability that 
only one of them will be selected ? [B.C.A. (Meerut) 2009] 


Solution: The probability of the husband's selection is 


1 — l1 6 
P =- > P[H]=l-=== 
=o = [H] 7S7 
The probability of the wife's selection is 
l — 1 4 
P(W]=-=>P[W]=l-+==-+ 
[ > [Ww] 575 


Probability Cai F. 


The probability that only one of them will be selected 
=P [H] P [W]+ P [W] P [H] 
14 16 10 


1 ge eee 
75 57 35 7 


Example 34: A and B take turns in throwing two dice, the first to throw 9 being awarded the price. 
Show that their chance of winning are in the ratio 9 : 8. 


Solution: The total of 9 can be obtained as follows: 
(3, 6), (4, 5), (5, 4) and (6, 3). Total number of cases = 6 x 6 = 36. 


No. of favourable cases= 4 


Then, P [A] =P [B]= 4 1 
36 9 

— 1 8 

Therefore, P[A]=P [B Bl=l-5=5 


Since A begins, he can win following mutually exclusive ways: 
(A),(A B A),(A B AB A).... 
Therefore, by the theorem of total probability, probability that A wins 


era +P[ABABA)+... 


iG) +) 


1 

9 

ia ; 8 
which is a G.P. with C.R. = 9 
1 

9 


zA olo 
l- 64 17 
81 
Since total probability is unity and one of two players is to win. Probability that B wins 
=]-— = = Z Hence, their chances of winning are in the ratio 9 : 8. 


Example 35: Three groups of children contain respectively 3 girls and 1 boy, 2 girls and 2 boys 
and | girl and 3 boys. One child is selected at random from each group. Show that the chance that 


the three selected consist of 1 girl and 2 boys is 13/32. 
[B.C.A. (Delhi) 2005; B.C.A. (Meerut) 2002] 


Solution: | girl and 2 boys can be selected from each group in the following mutually 


exclusive ways: 


o o 
= ae ee ee 
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By the theorem of total probability. 
Required probability = P [I] + P [H] + P LT] wD 
The probability of selecting a girl from the first group is 3/4, of selecting a boy from the 


second group is 2/4, and of selecting a boy from the third group is 3/4. Since the three 
events of selecting children from three groups are independent of each other, therefore 


puj- x eS 

4 4 4 32 

Similarly, P [I= = x Z x - = 2 
Piet a2 y le 

4 4 4 32 


Substituting in (1), the required probability is 
9. 3 1 13 
=—+—+—=— 
32 32 32 32 
Example 36: A and B are two independent witness (i.e. there is no collusion between them) in a 


case. The probability that A will speak the truth is x and the probability that B will speak the truth 
is y. A and B agree in a certain statement. Show that the probability that this statement is true, is 


= y 
l-x- y+2xy 
Solution: Let A, be the event that A and B agree in a statement and Ag be the event that 
their statement is correct 
Then P [Ay] =4y + l- 4x) - y)=l-x- y+2xy 
Now P [A Ag] = xy 


By compound probability theorem, we have 
A 
P [A Ag]=P [A] P 2l 
1 
Ay |_ PLA, Ao] _ y 
Ay P [Aj] l-x- y+2xp 


Example 37: A bag contains 50 tickets numbered 1,2, ..., 50. Five tickets are drawn at random 
and arranged in ascending order of magnitude : (x, < X9 <... < X59). What is the probability that 
x =30? 


Solution: Five tickets out of 50 tickets can be drawn in °° Cs ways. Since x3 =30, 


tickets 1; and x must be drawn from the set {1, 2, ... , 29} and x4 and x5 from the set 
{31,32,...,50}. The tickets x, x2, x4 and x5 can be chosen in 29 Cy x nC ways. Hence 
29 20 
C: C: 
P [x3 =30] = —— 
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Example 38: Find the probability of throwing 10 exactly in one throw with three dice. 
[B.C.A. (Meerut) 2004] 


Solution: We can throw 10 with three dice in the following ways: 
(1,3,6) 0,4,5) (1,5,4), (1, 6, 3), 


(2,2,6), (2,3,5), (2,4,4), (2,5,3), (2,6,2), 

(3,1,6), (3,2,5), (3,3,4), (3,4,3), (3,5,2), (3,6,1) 
(4,1,5), (4,2,4), (4,3,3), (4,4,2), (4,5,1), 

(5,1,4), (5,2,3), (5,3,2), (5,4,1), 

(6,1,3), (6,2,2), (6,3,1), 


The above favourable case are m =27 
Total number of exhaustive cases is =6x6x6=216 
27 -1 
H , th ired bability = — = =. 
ence, the required probability Jc S 
Example 39: Find the probability of a total of 9 when three pair dice are tossed once. 


Solution: We get a total of 9 when three fair dice are tossed once in the following 25 


ways: 
(1,2,6), (1,3,5), (1,4, 4), (1,5, 3) (1, 6, 2), 
(2,1,6), (2,2,5), (2,3, 4), (2, 4, 3), (2,5, 2), (2, 6, 1) 
(3,1,5), (3,2,4), (8,3, 3), (3, 4, 2), (3,5, 1), 
(4,1,4), (4,2,3), (4,3, 2), (4, 4, 1), (5 1,3); 
(5,2,2), (5,3,1), (6, 1,2), (6, 2, 1) 
Hence, the required probability = — z 


Example 40: Compare the probability of at least one 6 in 4 tosses of a fair die with the probability 
of at least one double - 6 in 24 tosses of two fair dice. 


Solution: Probability that no. 6 appears in 4 tosses of a fair die is 


-(-3)l-3)t-4)t-3)-6) 


Hence, the probability of at least one 6 in 4 tosses of a fair die is 


4 
5 625 671 
sg e ee = OE S 

(?} 1296 1296 (ppro 


Probability of a double - 6 i.e. the pair (6, 6) when a pair of fair dice is thrown = — . Thus, 


the probability that no double - 6 appears when a pair of fair dice is thrown 
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24 
and this probability when there are 24 tosses of two fair dice is S) . Hence, the 


probability of at least one double - 6 in 24 tosses of two pair dices 


24 
=|- (=) =0.49 (approx) 
The ratio of the two probabilities is 52 : 49. 


Example 41: In an assembly-line operation, =o the items being produced are defective. If three 
items are picked at random and tested. What is the probability ? 
(i) That exactly one of them will be defective ? 


(ii) That at least one of them will be defective ? 


Solution: Probability of a defective item = - 
J: 2 
Probability of a non-defective item =1 — 3 = 7 


(i) Probability that exactly one of the three items will be defective 
l 22 22 22d LB 


=—x=x Ege ie A eye) ale 
3 3 3 3 3 3 3 3 3 27 9 


(ii) | Probability that none of the three items will be defective 


-2% 2 yee 8 
ae ne ~ 27 
The probability that at least one of the items will be defective 
1-8-2 
27 27 


Example 42: A bag contains 10 white and 6 black balls. 4 balls are successively drawn out and not 
replaced. What is the probability that they are alternately of different colours ? 
[B.C.A. (Meerut) 2002] 


Solution: The balls are chosen without replacement. There will be two situations first 
ball is white or first ball is black. 


(I) Beginning with White: 
The probability of drawing a white ball = o 


The probability of drawing a black ball then = 2 


The probability of drawing a white ball then = = 


The probability of drawing a black ball then = = 
10 6. 9 5 _ 45 


The probability of the compound event = — x — x — x — =—— 
16 15 14 13 728 
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GI) Beginning with Black: 


The probability of drawing a black ball = - 


The probability of drawing a white ball then = L- 
The probability of drawing a black ball then = Z 


The probability of drawing a white ball then = — 


The probability of the compound even = 2 x 2 x 2 x EAE 
16 15 14 13 728 


45 45 


— x — =0.0038 
728 728 


Therefore, the required probability = 


Example 43: In a certain college, the student engage in different sports in the following 
proportions: 
Foot ball : 60 percent of all students. 
Cricket : 50 percent of all students. 
Both Football and Cricket: 30 percent of all students. If student is selected at random. What is 
the probability that he/she will: 
(i) Play football or cricket; (ii) Play neither sport. [B.C.A. (Meerut) 2006] 
Solution: Let number of total students =100 
Played football = 60 
Played cricket = 50 
Both played = 30 
Played only football =30 
Played only cricket = 20 
Not played =100 — (30 + 30 + 20) =100 - 80 =20 


30 3 

Probability to play football = =— = — 
robability to play footba 00 10 

20 1l 

Probability to pl cricket = —— = — 

or robability to play cricke 00° 5 
Probability to either sport = a = o 


Example 44: A bag contains 7 white, 5 black and 4 red balls. Four balls are drawn without 


replacement. Find the probability that at least three balls are black. 
[B.C.A. (Rohilkhand) 2008] 


Solution: Total number of black balls = 5 


Total number of non-black balls =7+4 =11 
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Required probability = P(5 black balls) + P(4 black balls) 
+ P (3 black balls) + P(2 black balls) 


R 4 3 1) R 4 ll =) 
=|— x— x — x — | + | — x — x — x — 
16 15 14 3 16 15 14 13 


5 11 4 3 11 5.4 3 
+| >= x— x— x— |+| — x> x — x 

& 15 14 >) k 15 14 >) 
660 660 660 660 2,640 


+ + + = 
43,680 43,680 43,680 43,680 43,680 


5 4 3 2 120 
=—X—X—X—= 
16 15 14 13 43,680 


P(A black balls) 


2,64 
Hence, the required probability = 640 + 129 
43,680 43,680 
_ 2,760 23 
43,680 364 
Example 45: 
(i) Ina simultaneaus toss of four coins what is the probability of getting exactly three heads ? 
(ii) Given two mutually exclusive events A and B such that P(A) =0.45 and P(B) =0.35, 
P(A orB) 
(iii) Ina single throw of two dice, find the probability of getting a total 9 or 11. 
(iv) Two dice are thrown. Find the probability of getting an odd number on one and multiple of 
three on the other. 
(v) A bag contains 5 green and 7 red balls. Two balls are drawn. What is the probability that 
one is green and other is red ? 
(vi) A bag contains 5 red, 5 white and 4 black balls. Two balls are drawn. Find the probability 
that one of them is red. [B.C.A. (Agra) 2009] 
(vii) The probability of hitting a target by three marks man are : 5 and > respectively. Find the 
probability that one and only of them will hit the target when they fire simultaneously. 
[B.C.A. (Lucknow) 2008, 2004] 
(viii) A boxx contains 3 white balls and 2 black balls, and another bag y contains 2 white balls and 
4 black balls. A bag and a ball out of it are picked at random. What is the probability that 
ball is white. [B.C.A. (Meerut) 2005] 
(ix) A and B throw a coin alternatively till one of them gets a head and wins the game. Find the 
probability of winning. [B.C.A. (Kurukshetra) 2006] 
Solution: 
(i) We have 


S = {HHHH, HHHT, HHTH, HTHH, THHH, HATT, HTTH, TTHH, THHT, 
HTHT, THTH, HTTH, THTT, TTHT, TTTH, HITTT} 
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P(3 heads) = á 
16 


(ii) Since A and B are mutually exclusive events. 
P(A or B) = P(A) + P(B) =0.45 +0.25 =0.70 
(iii) Let A = {getting a total of 9} 
B = {getting a total of 11} 
Favourable ways for getting total of 9 are: (6, 3), (3, 6), (5, 4), (4, 5) 
Favourable ways for getting total of 11 are (6, 5), (5, 6) 


Here, A and B are mutually exclusive events. 


Poot=P ares ee" =! 
36 36 36 6 


(iv) Favourable ways are: 
(1,3), (1,6), (3,1),(3,3),(3,5),(3,6),(5,3), 
(5,6),(6,1),(6,3),(6,5) = 11 ways 
Total no. of ways = 6 x6 =36 ways 
Required probability = = 


SC, x70 _ 5x7 _35 
12C) ‘12x11 66 
2xl 


(v) Required probability = 


(vi) Total balls =6+5+4=15 
Two balls can be drawn =15 Cy) ways 


Favourable ways of getting 2 non red balls =9 C3 
9x8 
C: 
Required probability oe. a 2a) - 36 el 
I5Cy 15x14 105 35 


2xl 
(vii) Here p(ay=t P(B) =4 p(cy=t 
, “9? TI 4 
S ee ee ee 
d P(A) =—,P(B)==,P(C) == 
an (A) 5 (B) 3 (C) 7 


L-2. 3- To I 3n T2] 
=—x= x4 —K—xK= 4K x= 
23 4 2 3 4 2 3 4 
1 1 1 6+3+2 ll 
=—+-+—= =— 
4 8 12 24 2 


(viii) Probability for choosing bag X = - 
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Probability of getting white ball out of it = > 


Probability to get a white ball out of X bag 


I-33 
= — X — = — 
2 5 10 
Similarly probability to get a white ball out of Y bag 
-1,2 1 
2° 6 6 


Required probability = = + > 7 -— = — = = 


i 1 
i P =-—,P(T)== 
(ix), PUH)=5, PO) =5 
A will win if he throws head in Ist, 3rd, 5th, ..... throws. 
. A’s probability of winning 
i (iy ily A 
=>+|=| .—+]—] .-+.. 
2 \2) 2 J 2 
l 
-2-2 E S. ->] 
j-. 3 1-7 
4 
, Ha R: 2 
and B’s probability of winning = i =1/3. 
Example 46: 
(i) If E,, By,...., Eare a set of mutually exclusive events and exhaustive set of events, prove that 
the non-occurrence of E or Ey or Ez means that at least one of the remaining events 
E, Ey,..., En must have occurred. 
(ii) What is the probability that number selected from the numbers 1,2, ...,25 is a prime number. 
You may assume that each of the 25 numbers is equally likely to be selected. 
[B.C.A. (Kanpur) 2006, 2004] 
(iii) One card is drawn from a pack of 52 cards each of 52 cards being equally likely to be drawn. 
Find the probability of 
(a) The card drawn is red 
(b) The card drawn is a king 
(c) The card drawn is red and a king 
(d) The card drawn is either red or a king [B.C.A. (Kanpur) 2007] 
(iv) One card is drawn from a pack of 52 cards so that each card is equally likely to be selected. In 


which of the following cases are the events E and F are independent: 


(a) E: The card drawn is a spade 


F: The card drawn is an ace 
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(b) E: The card drawn is black 
F: The card drawn is a king 


(c) E: The card drawn is a king or a queen 


F: The card drawn is a queen or jack. [B.C.A. (Kurukshetra) 2010] 


Solution: 


(i) 


(iii) 


Since £,, E ...,E, are a set of mutuality exclusive events and exhausline set of 
events. 


S=E UE UE U....UE, 
Where S is the sample space of experiment. 
S = (Ej, U Ep U Ea) U (Ey U 5 U...-U Ep) 


Let the outcome of the experiment be denoted by x since the events Ẹ, Ey, E3 are 
given to have not occurred. 


xg E UE UE. 
Also xebh UBU U....U E, 

x e F4 U Es U....U En [e xe UE UE] 
= x belongs to at least one out of Ey, Es .... En 


Hence the non-occurrence of EF, or Ey or E means that at least one of the 
remaining events E4, E5 ... E,, must have occurred. 


Here, S423 4.025) 
Let A be the event of getting a prime number. 
A ={2,3,5,7,11,13,17,19,2 1} 


9 


Required probability i.e. P(A) = T 


(a) There are 26 red cards in a pack of 52 cards. 


26 
C 
P (a red card) ee 

CG, 32 2 


(b) There are 4 kings in a pack of 52 cards 


(c) There are two kings which are also red cards 


2 
Required Prob. = pec = 2 = 1 
52 C 52 26 
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(d) Let A = {getting a red card} 
B = {getting a king} 
Then, P(AorB) =P(A)+P(B)—-P(AandB) 
1 1 1 _13+2-1 14_ 7 


“2° 13 26 £26 26 13) 


(iv) (a) There are 13 cards of spade in a pack 


131 
P= 


There are 4 aces in the pack 


paed 
52 13 


EN F = {an ace of spade} 


1 

P (EN F)=— 

ee) 52 
1 1 1 
Also,  P(£)xP(F) =—x—=— 
4 13 52 


Since , P(E) F) = P(E) xP(F) 
Hence, the events E and F are independent events. 


(b) There are 26 black cards in the pack 


Pes = 
52 2 


There are 4 kings in the pack 


se ees 
52 13 


EN F ={2 black king} 


2.1 


PENA =z = 56 


Also, P(E)xP(F) =x xL 
2 13 26 


Since, P(ENF)=P(EXP(F) 
Hence, the events E and F are independent events. 


(c) There are 4 kings and 4 queens in the pack 


n(E) =8 


C- n(EN F) =1) 
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8 2 
P(E)= 75 


There are 4 queens and 4 Jacks in the pack 


n(F)=8 


EN F ={4 queens} 


4 l 

P (EQN F) = — = — 

EAN 52 13 
Also, Peppa! ge = 
13 13 169 


Here, P(EN F) #P(E)xP(F) 
Hence the events E and F are not independent. 


Example 47: Three persons A, B, C throw a dice in succession till one gets a ‘six’ & wins the game. 
Find their respective probabilities of winning. [B.C.A. (A.M.U.) 2011, 2009] 


Solution: The chance of getting a ‘six’ when a dice is thrown = Pa 


‘ tees PREES l 
The chance of not getting a ‘six’, when a dice is not thrown = = = 


al 


The prob. of A' ‘s’ winning 


l l 

- ae E eee ee 
5% 125 91 6 91 91 

1-(3) 216 216 


The prob. of B's winning 


5 216 _ 5x6 31 


= x—s= = 
36 9I 91 -90 
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The prob. of C's winning 


(2) xt 25 25 
6 6 _ 36x6 _ 216 
\6) "6 _ 


5p y 225 91 

1-(3} 216 216 
-23 
9I 


Example 48: A bag contains 3 red and 5 black balls and a second bag contains 6 black balls and 4 
red balls. A ball is drawn from each bag. Find probability that one is red and the other is black. 
[B.C.A. (Kashi Vidhyapeeth) 2011, 2007] 


Solution: Total no. of balls in the Ist bag =3 +5 =8 


P (getting red ball) = 


oln œl% 


and P (getting black ball) = 


Total no. of balls in 2nd bag =6+4=10 


P (getting black ball) = 2 = 


P(getting red ball) = 
Required probability = P (red ball from Ist bag) xP (black ball from 2nd bag) 


+ P (red from 2nd bag) x P (black from Ist bag) 


3. 9 2 3 L9 
= x +x 2s 
8 5 5 8 40 
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À Problem Set ` 


If two balls are drawn from a bag containing 2 white, 4 red and 5 black balls. What 
is the chance that (i) both the balls are red (ii) one is red and the other black ? 


4C 6 4x5 ‘| 


=>; (ii) 4-4 
Il ll 
Cy 55 G 1 


| Hint: (i) 


A box contains 6 red, 4 white and 5 black balls. A person draws 4 balls from the box 
at random. Find the probability that among the balls drawn. There is at least one 
ball of each colour. 


| Hint: The required probability is 


ecx Oe aa 7 ec tal Oe aa ON . sox OP ans -4 


What is the chance that a non-leap year selected at random will contain 53 
Sunday? 


| Hint: In a non-leap year there are 52 weeks and 1 day (which can be any day of 


the week). There will be 53 Sundays if the remaining one day in Sunday whose 


probability is | : 


From a pack of 52 cards, two are drawn at random. Find the chance that one is a 
king and the other a queen. 


Hine; POS 8 
` 52 663 | 
Cy 
The chance of winning of two race horses are 1/3 and 1/6 respectively. What is the 
probability that at least one will win when the horses are running (i) in different 
races and (ii) in the same race ? 
| Hint: (i) Reqd. Prob.=1- (i — z) (! - i) -2 (ii) Reqd. Prob. =i + > - J 
Three ships A, B and C sail from England to India. Odds in favor of their arriving 
safely are 2 : 5, 3 : 7 and 6 : 11 respectively. Find the chance that they all arrive 
safely. 


Hint: (i) Reqd. Prob. = Z x 2 x i = ELS 

7 10 17 595 
From a group of 8 children, 5 boys and 3 girls. Three children are selected at random. 
Calculate the probabilities that the selected group contains (i) no girl, (ii) only one 


girl, (iii) one particular girl, (iv) at least one girl and v more girls than boys. 
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30 x5G 30 
(v) =a. E= 
C3 C3 
8. An urn contains 3 red and 4 black balls. Two balls are taken out at random. Find 


the probabilities that the ball are of (i) different colours (ii) black colour, (iii) red 
colour. 


2G x 4C 


| Hint: (i) Fe 
2 


4 3 
-4 (ii) 2 2 Gj a2 
7 en O 7 


9: Find the probability of throwing 15 with three unbiased dice. 
[Hint: We can throw 15 in the following ways: 
(6, 6,3), (6,3,6), (3,6,6), (6,5,4), (6,4, 5), 
(5,4, 6), (5,6,4), (4,5,6), (4,6,5), (5,5, 5) 
m=10,n=6x6x6=216] 
10. A person throw 3 coins. Find the probability that at least one lead turns up. 
11. Find the chance of throwing more than 14 with three symmetrical dice. 
[Hint: Reqd. Prob. = P [15] + P [16] + P [17] + P [18] ] 
12. A bag contains 4 red and 3 blue balls. Two drawings of 2 balls are made. Find the 
chance that the first drawing gives 2 red balls and the second drawing two blue 


balls (i) if the balls are returned to the bag after the first draw, (ii) If the balls are 
not returned 


[4o AE (46 \(3e 
| Hine) ka (e) a ka 3l 


Answers ` 
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7.2 Set Theoretic Concepts 


Let x be any universal set (e.g. X =R. set of all real numbers). Let A and B be any two 
subsets of X. Then 


l: 


(A) 


(B) 


AwB={4{x:xeA orxeBor x e both A and B} 
is called union of A and B. 

AB or AB ={x:x eA and x eB} 

is called the intersection of A and B. 
A-Bz={x:xeAbut x ¢ B} 

is called the difference of A and B. 

A or X-A={x:xe X but x ¢ A} 

is called the complement of A w.r.t. x 


The sets A and B are said to be mutually exclusive or disjoint if 
AB = 6 or AB = 6, where > denotes the empty set. Further, a sequence (A;) of 
subsets of X is said to be mutually exclusive if A; A; = 6 for each i # j. 


Sample Space [B.C.A. (Meerut) 2010] 


Suppose we toss a coin. The possible outcomes of this experiment are head (H) and 
tail (T). The collection of those outcomes is written as 


A={H,T} 


Suppose we toss two coins at a time. The collection of all possible outcomes of this 


experiment is 


B ={HH, HT,TH,TT} 


If we toss three coins at a time, the collection of all possible outcomes of this 
experiment is 


C ={HHH, HAT, HTH, THA, HTT,THT,TTH, TTT} 


The collection A, B,C are called sample spaces of the experiments of tossing a 
coin, two coins and three coins : respectively. We call each of the above 
experiments as random experiment in as much as its outcomes is uncertain and a 
given outcome is just one of many possible outcomes. 


The Sample Space is the collection of all possible outcomes of a random 
experiment. 


Probability Space: If Q is a sample space. A a o algebra of events (each a subset of 
Q) and P [.] a probability function with domain A. Then the triplet (Q, A, P [.]) is 
called a probability space. 
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Remarks: 
(i) Po] =0 


(ii) P [A] U Ay U.U A=) P [A;] 
i=l 


where Aj, Ag ,..., A, are mutually exclusive events in A 
(iii) For any event A in A.P [A] =1-P [A] 
(iv) For any two events A and B in A. 
P [A U B] =P [A] + P [B] -P [AB] 
(v) IfA and B are in A and A cB. Then P [A] <P [B]. 
Theorem: If A and B are mutually disjoint events, then 
P [A u B]=P [A] + P [B] 


Proof: Since A and B are mutually disjoint events, therefore 
AB =o =P [AB] =P [6] =0 


Result follows by part (iv) now. 


Theorem: (Bon-Ferroni's inequality) 
If Aj, Ag ,..., Ay E€ A. Then 
P [Aj Ap ... Ay] 2 P [Ay] + P [Ao] +... + P [A,] - (2-1) 


or 
ap) apes P(A) - (n-1) 
i=l i=l 


Example 49: If P [A] = 5 and P [B] = n can A and B be disjoint ? Explain. 


Solution: Suppose A and B are disjoint events. Then 
ANB=o>AcCB=P[A|<P[B] >-< 

which is impossible. Hence A and B cannot be disjoint. 

Example 50: A and B are disjoint events. P [A] =.5,P [AU B] =.6. What is P [B]? 

Solution: We know P [A U B] =P [A] + P [B] - P [AB] 


Since A and B are disjoint AB = 6 = [AB] =P [0] =0 


Hence, P[B])=P[AUB]-P[A]=6-.5=01 
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Example 51: If B c A, then 
(i) P[AnB]=P[A]-P [B] 
(ii) P [B]<P [A] 
Solution: 
(i) Since B c A. So B and A AB are mutually disjoint events and 
A=BU(AAB). 
> P[A]=P[BU(AnNB)]=P[B]+P[AnB] (1) 
Hence, P[ANB|=P[A|-P [B] 
(ii) We know P [A n B] 20 =P [A] -P [B] > by (1) 
Hence, P [B] <P [A] 
Corollary. Since AM B c A and A A B CB, therefore, 
P[A^aB]<P [A] and P [A ^A B]<P [B] 
Example 52: If A,B and C are any three events, then 


P[AUBUC]=P[A]+P[B]+P[C]-P[AnB] 
-P[BAC]-P[COA]+P[ANBOC] 


Solution: We have P[A UB UC]=P [(A UB) UC] 


=P[AUB]+P[C]-P[AUB) aC], Ck) 
Now, PLAVB)AC]=P (ANC) U(BAC)], by distributive law 

=P[ANC]+P[BAC]-P[ANC)A(BAC)], 

=P[ANC]+P[BNC]-P[ANBonC] (2) 
Also P{[AUB]=P[A]+P [B]-P[AnB] (3) 


Using (2) and (3) in (1) we get the required result. 


Example 53: Three newspapers A, B, C are published in a city and a survey of readers indicates 
the following: 

20% read A, 16% read B, 14% read C 

8% read both A and B, 5% read both A and C 

4% read both B and C, 2% read all the three. 


For a person chosen at random, find the probability that he reads none of the paper. 


SURRY Wehave PA] =20%=2"., Ppp -16, 
100 100 


Pigs. 
0 100 


PUA es Page 
100 100 
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PIRAC=— = PUABACHS = 
100 100 


The probability that the person reads A or B or C is given by 
P[AVUBUC]=P [A] +P [B]+P[C]-P[AnB]-P[BaC] 
-P[CNA]+P[ANBaC] 


20, 16,14 8 5 42) 
~100 100 100 100 100 100 100 
ee 


~ 100 20 


Hence, the probability that he reads none of the papers 


=1=P obut ei 
20 20 


Example 54: If P [A] =0.9, P [B] =0.8, show that P [AB] 20.7. 


Solution: By Bon-ferroni inequality, we have 
P [AB] 2> P [A] + P [B]-1 
20.9+0.8-1=0.7 
Hence, P [AB] 20.7 


Example 55: Event A and B are such that P [A] = and P [B] = > show that 


(i) P[AUB]>Ž 


3 5 
i) 2<P[AB]<Ż 
(ii) g TABS 
1 = 3 
iii) —<P[AB]<= 
(ii) <P [AB] <= 
Solution: 
(i) SinceACAUB,P [A] <P [AUB] 
Thus, + <P [AUB] =P [AUB] 22 


(ii) Since AB=ANBCB=>P{[AB|<P [B] -> 
Thus, P [AB] < >, by Bon-ferroni's inequality 


P [AB] > P [A] + P [B] -1= 2-152 


Aw 


= <P [AB] < 


œu 
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(iii) Again P [AB] >P [A] + P [B] 1-24 (1 z) I= 
Since AB = AB cB.P [AB] <P (B]=1-2 == 

de ppaB <2 

8 8 


Example 56: Events A and B are such that 


3 l oo 
P[AUB]=2,P[AB]=—and P [A] == 
[Au] [ ]=7%® [A]=3 


Find P |B] and P [AB] 


Solution: 
We know P [A U B] =P [A] + P [B] - P [AB] 

3 2 I 2 
= +-(1 Z) +P BI gor les 
Now, P [AB] =P [AN B]=P [A] -P [AB], 

p(aB)=|1-2|-2 = 
3] 4 12 


Example 57: Ina very hotly fought battle, the probability of a combatant losing an eye was 70%, 
an ear 75%, an arm 80% and a leg 85%. Show that the probability of a combatant losing all the 
four was not less than 10%. 


Solution: Let Aj, A, A and A4 denote the events of the combatant losing an eye, an 
arm, an ear, and a leg respectively. Then 
P [A] =0.70, P [Ay] =0.75, P [Ag] =0.80, P [Ay] =0.85 

By Bon-ferroni's inequality 

P [A Ay A3 Aq] 2 P [A] + P [Ag] + P [Ag] + P [Ag] - (4 -1 

20.70 + 0.75 +0.80 + 0.85 -3 =0.10 

P [A A Ag Ay] 20.10 
> P [A, A Az Aq] £0.10 (10%) 
Example 58: A purse contains 2 silver and 4 copper coins. A second purse contains 4 silver and 3 


copper coins. If a coin is pulled out at random from one of the two purses, what is the probability that 


it is a silver coin ? 
[B.C.A. (B.H.U.) 2010] 


Solution: Probability of choosing first purse = 2 


w| = 


Probability to getting a silver coin out of it = a = 
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Probability to get a silver coin and first purse 


1 1 1l 

= — X — = — 
2 3 6 

Similarly probability to get a silver coin and IInd purse 

1 4 2 

= — X — = — 
2 7 7 

Required probability = 1l + 2 Eae 13. 

6 7 42 42 


Example 59: One bag contains five white and four black balls. Another bag contains seven white 
and nine black balls. A ball is transferred from the first bag to second bag and then a ball is drawn 

from the second. Find the probability that the ball drawn is white. 
[B.C.A. (Kashi Vidhyapeeth) 2009] 


Solution: The ball transferred from the first bag to second bag can either be white or 
black. 


Case I: When white ball is transferred from first bag to IInd bag. Probability of getting 


white ball from Ist bag. 5 


“6+9 


Now second bag contains 8 white and 9 black balls. The probability of drawing a white 
ball from the IInd bag 


= 8 pee, 
(8+9) 17 
The probability of both these events taking place together is 
5 8 40 
— X — = —, 
9 17 153 


Case II: When a black ball is transferred from Ist bag to IInd bag. 


Probability of getting black ball from Ist bag 
ek: a 
6+4) 9° 


Now, second bag contains 7 white and 10 black balls. The probability of drawing a white 
ball from second bag 

o 7 _7 
~ (7410) 17` 


The probability of both these events taking place together 
4 7 28 


9°17 153 

Since the above two case are mutual exclusive, therefore only of them can happen. Hence 
the required probability 

„do 28 _ 68 

153 153- 153 


Es 
` 
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Example 60: An almirah stores 5 black and 4 white balls. A boy pulls out 2 balls at random. 
What is the probability of the same colour ? [B.C.A. (Meerut) 2002] 


Solution: Total no. of balls =5 +4 =9 


Total no. of ways choosing 2 balls out of 9 balls 


gan Boo 
N(S) = oan 


No. of favourable events N(E) = 18 + i =16 


P(Getting 2 balls of same colour) = NUE) 7 a = E 
N(S) 36 9 


Example 61: A man has 3 children with atleast one boy. What is the probability that he has 2 
boys and one girl. [B.Ed. Entrance Exam.-2002] 


Solution: Total no. of events N(S) 


=3G, x7 +3, x3G =3x3 +3 x3 =18 


Favourable no. of events N (E) =3C, x 3C =3x3=9 


P (getting 2 boys and one girl) = as 
=> pee ai 
18 2 


Example 62: There are six different letters and 6 correspondingly addressed envelops. If the letters 
are randomly put in the envelopes, what is the probability that exactly 5 letters go into the correctly 


addressed envelopes ? 


Solution: Probability of r particular letters out of n letters in the right envelopes 


n-r 


ln 


Probability that exactly 5 letters go into the correctly addressed envelopes 


_ (6-5 _| 
6 


[6 


me 
720 


=0.0013 =0 


[A 
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7.2.1 Computation of the Probability of an Event by the Axiomatic 
Approach 


Consider the probability space S where the sample space. S is finite. Let S contain N 
sample points W, W ,..., Wy. Suppose that these sample points are equally likely i.e. 


P KWI} =P {W2} =... = P HWN} wl) 
Then the probability of any event A is given by 
N (A) 
P [A] =—— (2) 
[A] NS) (2) 


Here N (A) and N (S) denote the number of points in A and S respectively. 


The probability function P [. ]. Satisfying conditions (1) and (2) above is said to be an 
equally likely probability function. 


Example 63: Find the probability of getting (i) a total of 7 and (ii) a total of 9 when two dice are 
thrown together. 


Solution: The sample space is 
S={i,j):lsis6,l<j <6}, N (S) =36 

It is easy to verify that all sample points are equally likely, as the probability of each 
sample point is 1/36. 
Let Az = event that the total is 7. 

Ag = event that the total is 9. 

Az ={(1, 6), (2,5), (3,4), (4,3), (5,2), (6,1) 

Ag = {(3, 6), (4,5), (5,4), (6, 3)} 


We have N (A7) =6, N (Ag) =4 
Hence, P [47] = N A) = ee = L 
N(S) 36 6 
-NM 4 _1 
Pig- N(S) 36 9 


Example 64: One urn contains three red balls, two white balls and one blue ball, A second urn 
contains one red ball, two white balls and three blue balls, one ball is selected at random from each 


urn. 
(i) Describe the sample space for this experiment. 
(ii) Ifthe balls in the two urns are mixed in a single urn and then a sample of three is drawn. Find the 


probability that all the three colours are represented when sampling is done (a) with replacement, 
(b) without replacement. 


Solution: Urn I ={7, %, 7, W, w, by, Urn H = {R}; M , W , Bi „Ba , Bg}. Here r;, R denote 


red colour; b;,B; blue colour and w;, W; white colour. 


D 
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(i) | As each urn contains 6 balls, the sample space of the experiment consists of 
6 x 6 = 36 points; 

S ={AR, nR; wW, m Wo, w W, w W3; DB), bBo, bB3;...} 

(ii) Ifthe balls in the two urns are mixed in a single urn, then the latter urn contains 12 
balls out of which 4 balls are red, 4 white and 4 blue. Draw a sample of 3 balls, 
where the sampling is done with replacement. The probability of the first ball (of 
any colour) is 12/12. Now we have 8 balls of the remaining two colours and the 
probability of the second ball (of the second colour) is 8/12. Lastly, the probability 
of the third ball (of the third colour) is 4/12. Hence, the probability that a sample 
of 3 balls contains all the three colours is 

12 8 4 2 
BRI 
When the sampling is done without replacement the probability that a sample of 


three balls contains all the three colour is 


tax C xia _4x4x4x3x2 16 


re 12x11x10 55 


Example 65: An urn contains five balls numbered 1 to 5 of which the first three are black and the 
last two are gold. A sample of size 2 is drawn with replacement. Let B, denote the event that the first 
ball drawn is black and By denote the event that the second ball drawn is black. 


(i) Describe a sample space for the experiment and exhibit the events B,,By and B,Bo. 
(ii) Find P [B,], P [By] and P [B,By | 


(iii) Repeat parts (i) and (ii) for sampling without replacement. 
Solution: Urn = {h , b, by, g4, g5} (b = black, g = gold) 


(i) | The sample space consists of 5 x 5 =25 points viz. 
S = {hy hb, bbs, by ga, gs» baby, bob, babs, by ga, b285 3-3 
85b 8549, 8505, 8584, 8585} 
(ii) The events B,, By and B; By are described below: 
By = {hh hb, bils , b84, hgs, yh, bob, bobs , bo 84, ngs, 
bish , b3by , b3b3 , b3 84,0385} 
By = {hh , hb, hbs, boh, bab», by by, bzh, bsb, bbs, 
£4, 84b2, 8403, 85b, &5b2, g5b3} 
BiB = By A By = {hh hib, bibs, bh, babs, by by, bzh , bby , bzbs } 
15.23 15 


3 9 
P [B]=33 553? [B2] zs ~ 5? eal =5e 


(iii) In sampling without replacement, the sample space S consists of 5 x 4 = 20 points. 
We obtain these 20 points by excluding the points hh, byl), b3bh3, g4 g4 and 
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g5 g5 from S. The events B) and By now (consist of 12 points each (exclude 
bb, by by and bshs). Similarly Bj By consists of 6 points. 


12 3 6 3 
H A P B =—~=-=Pp B dP BB = — = — 
ence [B] 20 5 [Bp] and P [BiB] 20 10 


Example 66: Find the probability that a certain 13-card hand contains exactly 8 spades. 


Solution: Total number of cards is M =52 


Total number of spades =13 


We can regard a 13-card hand as a sample of size 13 drawn without replacement from the 52 
cards. Let Ag denote the event of drawing exactly 8 spades in a 13-card hand. We can draw 


52 13 
13 cards from a pack of 52 cards in E ways. 8 spades out of 13 can be selected in l 8g | 


ways and the remaining 13-8 cards out of a total of 52-13 can be selected in 
52-13 39 
= ways. 
13-8 5 
13) (39 52 
Hence, P (Ag) = + 
8j{5 13 


7.3 Conditional Probability 


Let A and B be two events in a sample space (S). The conditional probability of event A 


given event B, denoted by P [2] is defined by 


„if P [B]>0 


rb 
B] P[B] 


e 


AJ P{A] 


Similarly, A 


„if P [A] >0 


is the conditional probability of event B given event A. From the above relations, we see 
that 


P [AB] = P [A/B] P [B] =P =| P [A] 
where P [A] >0 and P [B] >0. 
Example 67: Given P [A] =0.5 and P [A U B] =0.6 Find P [B] if: 


(i) A and B are mutually exclusive. 


(ii) A and B are independent. 


(iii) P [Z] =04 
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Solution: We know P [A u B] =P [A] + P [B] -P [AB] (1) 
(i) IfA and B are mutually exclusive AB = ọ 
P[AUB]=P [A] + P [B] -P [6], where P [6] =0 
Hence , P [B] =P [AUB] -P [A] =0.6 -0.5 =0.1 
(ii) If A and B are independent, then 
P [AB] =P [A] P [B] 
Putting in (1), we get 
p[Av B] =P [A] + P [B] -P [A] P [B] 


or P [A U B] =P [A] + P [B] (l -P [A]) 
or 0.6 =0.5 + P [B] (1-0.5) 
Hence, P [B] == 
(iii) We know P [A/B] = ZIA or P [AB] =0.4 P [B] 


Putting in (1), we get 
0.6 =0.5 + P [B] -0.4 P [B]=0.5 +0.6 P [B] 


Hence, P [B] = 


Example 68: If P [B]=P |5] =P |£] = > what is P [ABC] ? 


Solution: By multiplication rule, we have 


B C 
P [ABC] =P [A] P |= | P | — (1 
[ ] [A] [2] £] (1) 
By the definition of conditional probability we have 
A B 
P[AB)=P|}—|P [B])=P|—|P[A “(2 
[AB] =| [B] [z] [A] (2) 
From (1) and (2), we have 
piabc)=P|2 |r mr] ]=2.5.5=2 
B AB] 222 8 


Example 69: A single die is tossed; then ncoins are tossed, where nis the number shown on the die. 
What is the probability of exactly two heads ? 


Solution: Let B, By ... Bg be six mutually disjoint events, where B j: j appears on the die. 
1< j <6. Clearly P[Bjl=7,1<j<6. 

6 
Let B= U 
j= 


s] 


l ] 
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Let A denote the event when the head appears exactly twice, we have to find P [A]. By the 
theorem of total probability, we have 
6 
1 A 
= — P — 
1$ elé 


E A 
P[A] =}, pine 
j=l J] ‘= 


Note that if face 1 appears on the die we can not get two heads and so P 4] =0. 


I 20). PCy. 8G. "Ce, 8G 
Thus, P[A]==]|0 + —=2 + 24 2 H I IH I2 
ji a d 2 3o 2 P 2 
l 3 6 10 15 
=—|l+>+-+— + 
24 2 4 8 l6 
il 16+244+24+20415 _ 99 
~ 24 16 ~ 384 


Example 70: Five percent of the people have high blood pressure, of the people with high blood 
pressure, 75 percent drink alcohol; whereas only 50 percent of the people without high blood pressure 
drink alcohol. What percent of the drinkers have high blood pressure ? 


Solution: Let A denote the event that people have high blood pressure and B denote the 
people who drink alcohol. 


We have P{aj=0.05,P]4|=0.75,P|5|=0.5 
A A 
We have to find [sj (1) 
A P [B] 
We know B = AB U AB and AB n AB =ọ 
. P [B] =P [AB] + P [AB] =P [A ^a B] + P [B ^n A] (2) 
Now p|4)=0.75 
A 
= PIBOA) _9 75 
P [A] 
> P [A nB] =0.75 x0.05 =0.0375 i) 
Also p||-0.75 
A 
> EIB OAL 5g 
P [A] 
=> P IBN Al=5(1-P (Ap 
> P[B]-P[A ^ B]=5 (1-0.05) = (0.95) =0.475, by (2) 


P [B] =0.0375 +0.475 =0.5125, by (3) 
4] 0.0375 375 


B| 05125 5125 


Hence, by (1), 3 


=0 073 


Hence, the required percentage is 7.3%. 
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7.4 Baye's Theorem 


Let By, By ... B,, be a collection of mutually disjoint events in T of the given probability 
space (S). Such that 


n 
S=U B; and P [Bj] >0 for j =1,2,...,n 
jal 


Then for each A e T satisfying P [A] >0, we have 


A 

P |— |P [B 
p|2x]- A as 
A n 


5P 


j=l 


A 
5 P [B] 


[This is known as Baye's formula] 


Proof: By the definition of conditional probability, we have 
.| P[B,A 
r |2]- LB; | C- P [A] >0) (1) 
A P [A] 
. | A | _ P [AB] 
B] P [By] 


and 


From (1) and (2), we obtain 
A 


P A P [By] 
p EI __ lB] 
A P [A] 


By the theorem of total probabilities, we have 
n 
A 


P[A]= J, P s 


— |P [B;] , for each A eT 
j=l 


] 


Substituting in (3), Baye's formula is proved. 


The following particular case of Baye's formula is quite useful. 
Corollary: If A,B e T where P [A] >0 and O <P [B] <1, then 
A 
B P Fal [B] 
” Al al Als pox. slAn 
P BG [B] + P pr [B] 
B B 
Proof: By the definition of conditional probability, we have 
|=" [BA] Flee [AB] 


A| P[A]’ |B] PIB] 
A 
p(£]p [B] 
p2- AA) 
A P [A] 
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By the corollary of theorem, we have 


a 
B 


P [B] P A P [B] (5) 
Jeo ls | 


P{Aj=P| > 


The result follows from (4) and (5). 
Remark: If an event E can occur in combination with two mutually exclusive events 
E and Ey. Then 
E 
P [E] P| = 
sef] 


Ey = —— rs. 519 


|] 
E E E |’ 
Pie | EP) 2 =| 


Multiplication Rule 


Let Aj, Ag ,..., A, be arbitrary events in T of the given probability space (S) such that 
P [A Ag ... A, 1] > then 


A A A 
P [4AA ... A] =P [4A] P|] P| < l... Pp} ——+——_ 
[ 1442 al L J] 2l 2] atc 


Example 71: The probability that a person can hit a target is 3/5 and the probability that another 
person can hit the same target is 2/5. But the first person can fire 8 shots in the time. The second 
person fires 10 shots. They fire together. What is the probability that the second person shoots the 
target ? 


Solution: Let E denote the event of shooting the target, E and Ey respectively denote 


the events that the first person and the second person shoot the target we are given 


r|£]-2 nar] = |-2 
EB} 5 Ey | 5 


The ratio of the shots of the first person to those of the second person in the same time is 


È = £, Thus, P [Ę] = £ P [E ]. By Baye's theorem, we get 


PIE]. 


4 3 
zP 5]. z tP ial. 


aN 


eee E 


Ge” 
5 
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Example 72: Urn Acontains 2 white and 2 black balls. Urn B contains 3 white and 2 black balls. 
One ball is transferred from A to B and then one ball is drawn out of B. Find the chance that this 
ball is white. If this ball turns out to be white, find the probability that the transferred ball was 
white. 


Solution: Let Ẹ& and E denote the events that the transferred ball from A to B is white 


and black, respectively. Let W denote the event that a white balls is drawn from B. Then 


2 2 |w] 4 [w] 3 
apria Ee a] 
pwp P| 2 |« rer] a(l) 
1 2 
24 22 7 
=—.-+-—.-=— 
496 46 12 


Example 73: An urn contains a white balls and b black balls, another contains c white and d black 
balls. One ball is transferred from the first urn into the second and then a ball is drawn from the 
latter. What is the probability that it will be a white ball ? 


Solution: We know that 


a b 
PJE = zP = 
[Ai] a+b [4] a+b 
p| |- FF. pW j|-—< 
EĘ&]| (c+)+d E | c+(d+)) 
pwr E] ree E] 
E Ey 
_ a c+ E b c 
ath ct+td+l ath c+d+l 
Hence P [w] ac+bc+a 


“(at b(c+d+) 


Example 74: Three urns Aj, Ag and Ag contain respectively 3 red, 4 white, 1 blue, 1 red, 2 


white, 3 blue, 4 red, 3 white, 2 blue balls, one urn is chosen at random and a ball is withdrawn, it is 
found to be red. Find the probability that it came from urn Ag. 


Solution: If A; denotes i urn chosen and R denotes the event of withdrawing the red 


ball, then l 
P [A] =P [A] =P [A] 7 
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l 
E 1 216 12 


Example 75: Urns A, B,C have the following coloured balls: 


A: 6 red, 4 white, B : 2 red, 6 white, C : 1 red, 8 white. An urn is chosen at random; a ball drawn 
turns out to be red. Find the chance that urn A is chosen. 


| Hine P[R]=P [A] [$] Pw Face (C) P [Z] 


Té 2 | 173 
=—|/—+—=4+-—/=—— 
310 8 9] 540 


By Baye's theorem 


R 
vja Eg e 
R]  P[R] (3'10) 540 173 


Example 76: There are five urns and they are numbered 1 to 5. Each urn contains 10 balls urn ihas 
idefective balls and 10 — inon-defective balls (i = 1, 2,3,4, 5). An urn is selected at random and then a 
ball is selected at random from the selected urn. What is the probability that a defective ball will be 
selected? If the ball is defective, what is the probability that it come from urn 5 ? 


Solution: Let E denote the event that a defective ball is selected and Æ; the event that 
urn iis selected; i=1,2,3,4,5, we have 


Pee P E oo 7503.45 
5 E| 10 


By the theorem of total probabilities, the probability that a defective ball is selected is 
given by 


E 
P[E] -Y _P[E]P žl 
i=l E; 


L 


5 ! 
-> ee or era ae 
45°10 50 10 
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(ii) The probability that the defective ball came from urn 5 is given by 
e2]: P [E] Es] P [Es] 


3 

> PIE] E P[E] 
i=] 

OF. 1 
210 5.) 

3 8 

10 


Note: That the urn 5 has 5 defective balls out of a total of 10. 


Example 77: An urn contains 10 balls of which 3 are black and 7 are white. The following game is 
played. At each trial a ball is selected at random its colour is noted and it is replaced by two 
additional balls of the same colour. What is the probability that black ball is selected in each of the 
first three trials ? 


Solution: Let B; denote the event that a black ball is selected in the i® trial. We have to 
find P [B, By Bs] by the multiplication rule, we have 


P Bh By Ba) =P | P| | | B l 
1 


B,, By 
3 3+2 3+2+2 
= — xX * | — 
10° (10 +2 10+24+2 
ee ee 
“10°12 14 16 


Similar Example: In the above example, find the probability that a white ball is selected in each of 
the first three trials. 
10 12 14 


Example 78: A die is thrown as long as necessary for a 6 to turn up. Given that 6 does not turn up 
at the first throw. What is the probability that more than four throws will be necessary ? 


Solution: Let A; be the event that 6 does not turn up at the ith throw where i=1, 2,3, 4. 
Then 


PA) =2 


Notice that more than four throws are needed given A; means that in the first four throws 
6 does not turn up. 


The required probability is P using multiplication rule, we have 


|2 Ay A3 “| 
A 
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P [A] P Pau Ap — 
P 4 Ay A3 A4 | 7 A A, Ap Aj Ay A3 


A P [A] 
a e 
| Aj Ag A, Ag As 
> ..5: 9° 125 
=L x2 x> = 
6 6 6 216 


Example 79: An insurance company insured 2000 scooter drivers, 4000 car drivers and 6000 
truck drivers. The probability of any accident involving a scooter, car and a truck is 0.01, 0.03 and 


0.15 respectively. One of the insured persons meets with an accident. What is the probability that he 
is a scooter driver ? 


Solution: Let £, Fy and E; denote the events that a scooter driver and car driver, truck 
driver is chosen respectively. Then 


2000 1l 4000 6000 


PLE = g LB! =ip000 73°? [B= aon = 


l 
12000 -3 2000 2 


Let A denote the event the any of the drivers meets with an accident. The probability 
that a scooter driver meets with an accident is 


P A =0.001. Similarly, P A =0.03;P A =0.15 
B E Es 


The probability that the insured person is a scooter driver is given by 


A 
P[|Ę&]P|— 
pfä]- [E] 4l 
A A A A 
TOTEAN 


1 
z©0D 


I I 
z (0.01) + 5 (0.03) + (0.15) 


E 0.01 0.01 1 
~ 0.01+2 (0.03) +3 (0.15) 0.52 52 


Example 80: Three boxes contain galvanometers, some of which are detectives. The proportions of 
defective in boxes A,B and C are respectively 1/3, 2/9 and 1/6. A box is selected at random and a 
galvanometer drawn from it. If the selected galvanometer is found to be defective. What is the 
probability that the box B was selected ? 


Hint: Required probability is = 


Probability 515 


Example 81: In a bolt factory machines A, B and C manufacture respectively 25, 35 and 40% 


of the total. Out of their output 5, 4 and 2 Percent are defective bolts. A bolt is drawn from the 
produce and is found defective. What are the probabilities that is was manufactured by A, B and C ? 


Solution: Let E be the event that the bolt is defective and E,, Ey, E3 denote the events 


that the bolt is produced by A, B, C respectively. 


We have P [E,] =025, P [Ey] =035, P [E3] =040 
P = =005,P is =0 04 and P E =0 02 
B Ez Es 


It is required to find P Fa P Fa and P [>] 
E E E 


By Baye's theorem, we have 
E 
(0.25) (0.05) 


~ (0.25) (0.05) + (0.35) 0.04) + (0.4) (0.02) 


E 
P [EĘ]P =] 


E E E 
P [BIP Be [Ey | P Be P [Egz] P B 


A] _125 25 

E| 345 69 

Similarly, 

E 0.35 x0.04 140 28 
E | 0.25 x0.05 x0.35 x0.04+0.40x0.02 345 69 


and p|2|-1- pZ] e2] E 
E E E 69 69 69 


Example 82: The contents of urns I, II and III are as follow: 
I white, 2 red and 3 black balls, 

2 white, 3 red and I black ball, and 

3 white, 1 red and 2 black balls. 


One urn is chosen at random and two balls drawn. They happen to be white and red. What is the 
probability that they come from urns I, II or III ? 


Solution: Let —,, E and E denote the events that the urns I, II and III are chosen, 


respectively and let E be the event that the two balls taken from the selected urn are 


white and red, then 


P [ER] =P [Ey] =P [E] -5 


and 


Similarly, 


and 
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1 2 2 3 
lz C x 2, p|£)- C x Ci 6 


sœ 15 


Probability 315 


\ Problem Set N 


A coin is tossed 4 times. Define a sample space T which describes this experiment 
and give the subsets of T which correspond to the following events : 


(i) |More heads than tails are obtained. 

(ii) | Tails occur on the even numbered tosses. 

(iii) Head occur on the third toss. 

If P [A] =0.9, P [B] =08, show that P [AB] = 0.7. 

[Hint: P [AB] > P [A] + P [B] - 1 (Bon-ferroni inequality) ]. 

Arrange the following in increasing order of magnitude : 
P[A],P [AU B],P [A a B], P [A] + P [B] 


For disjoint events A,B,C are the following permissible assignments of 
probabilities ? 

P [A] -2 p (B)=4,P [C] -1 

3 4 6 


which is impossible. 
[Hint: P[A UB UC]=P [A]+P[B]+P[C]-P [An B]-P[BaAC] 


-P[ANC]+P[ANBaC] 
2,1. 1 33 


=P [A]+P[B]+P =—+—+—-—=— >] 
[A] + P [B] + P [C] sate pa 
Given P [A] =3,P [B]=3,P [A UB] =i find P| and P |Z] 

3 4 12 B A 


| Hint: P [A U B] =P [A] + P [B] - P [AB] = P [AB] -z| 
P [A] P [B 
Now, use P [AB] = ca =P [B|P |] : 
A E 
An urn contains four tickets marked with numbers 112, 121, 211, 222 and one 


ticket is drawn at random. Let A; (i=1,2,3) be the event that its digit of the number 
of the ticket drawn is 1. Discuss the independence of the events Aj, Ay and As. 


2 1 l 
Hint: P [A)]=—=—,P [Ao] =P [Ag] == 
| Hin [A= 75>" el (4s)=5| 
A, A Ag is the event that the first two digits in the number which the drawn ticket 


bears is each 1. 


l 
P [A n AQ] =7=—5.5=P [A]P [Ay] = 4,4 


are independent. Similarly Ay, A3 , Ag, Aj are independent. 


Now, P [A, A A 0 Ag] = probability that all the three digits are each 
1=0 +P [A] P [Ay] P [A]. Hence Aj, A, A are not mutually independent. 
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7. There are two identical boxes containing respectively 4 white and 3 red balls, 3 
white and 7 red balls. A box is chosen at random and a ball is drawn from it. Find 
the probability that the ball is white. If the ball is white. What is the probability 
that it is from first box? 


| Hint: E = event that it is first box > P [] -5 


E = event that it is a second box => P [Fy | -5 


E = event that the ball is white 
meee es 
E 7 E 10 


ü) P[E]= pg? |Z] + r |Z | 


ab IG F (5 3] lag) 


(ii) By Baye's theorem, we have 


( 
pa) ert JE 140 _ 40 


P [E] x5 6l 6l 


8. Suppose 5 men out of 100 and 25 women out of 10,000 are colour blind. A colour 
blind person is chosen at random. What is the probability of his being male? 
(Assume male and female to be in equal numbers). 


[Hint: Æ = Person is a male, E = Person is a female, E = Person is colour blind. 


Then, PiBl=P (Bij. rior Eloo 
2 |E Ey 


Find P =| by Baye's theorem. ] 


Answers 


> [Paon PIA}, PIA} +P IB), PAUB] | 4. 


| Ay, Ag, A3 are not mutually independent. -eE 140 l 2 


uun 
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Statistical Quality Control Methods 


8.1 Quality 
Quality is a measure of how closely an item conforms to standards. The standards may 
relate to time, materials, performance, reliability, appearance of the product. 
As such in general, we may say that a product is considered to be good quality if it gives 
the desired services satisfactorily. The quality consisting the following aspects. 


Customer's preference i.e. the degree or level to which it is preferred by the 
customers over similar products from other sources. 


25 Degree of general excellence of the product as compared to other products of the 
same class. 

3. Quality characteristic i.e. its appearance, life, taste, odour, reliability, etc. 

4. Fitness for the purpose for which it is meant. 


8.2 Method for the Measured the Quality 


There are two methods to measure the quality of a manufactured product: 


l. The Numerical Measurement: We decide about the quality of a product by 
making numerical measurements on it, such as length, height, radius, density, 
area, volume etc., and the items which are meeting with the specification are 
accepted. 


2. Attributal Measurement: When any item is divided into two defective or 
effective classes then attraibutal measurement is applicable. 
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8.3 Quality Control 


The Control of quality involves, measurement, feedback, comparison with standards and 
correctness when necessary. 


Thus, the basic characteristic of quality control are: 
Je To insure that the desired specifications of the product are achieved. 


2. To Compare the product with standards and correct it when necessary. 


8.4 Statistical Quality Control (S.Q.C.) 

[B.C.A. (Lucknow) 2007, 2004; B.C.A. (Meerut) 2006, 2004, 2002; 
B.C.A. (Kanpur) 2005; B.C.A. (Agra) 2003] 
A quality control system performs inspection, testing and analysis to ensure that the 
quality of the products produced is as per the laid down quality standards. It is called 
“Statistical Quality Control” when statistical techniques are information system to 
satisfy the concept of prevention and control and improving upon product quality 
requires statistical thinking. In other words, the main purpose of statistical quality 
control is to device statistical methods for separating the variation of random nature 
from the other type, so that we may take appropriate action as quickly as possible when 

preventable causes are present in the system. 


Statistical quality control (S.Q.C) is systematic as compared to guess-work of haphazard 
process inspection and the mathematical statistical approach neutralizes personal basis 
and uncovers poor judgement (S.Q.C) consists of three general activities. 


i; Systematic collection and graphic recording of accurate data. 
2; Analyzing the data 


X: Practical engineering or management action, if the information obtained indicates 
significant deviations from the specified limits. 


Modern techniques of statistical quality control and acceptance sampling having an 
important part to play in the improvement of quality, enhancement of productivity, 
creation of consumer confidence and development of industrial economy of the country. 


The following statistical tools are generally used for the above purposes: 


8.4.1 Frequency Distribution 


Frequency distribution is a tabulation of the number of times a given quality characteristic 
occurs within the samples graphic representation of frequency distribution will show: 


i) Average quality 
ii) Spread of quality 
iii) Comparison with specific requirements 


iv) Process capability. 
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8.4.2 Control Chart 


Control Chart is a graphical representation of quality characteristics, which indicates 
whether the process is under control or not. 


8.4.3 Acceptance Sampling 
The acceptance sampling is the process of evaluating a portion of the product or material 


in a lot for the purpose of accepting or rejecting the lot on the basis of conforming to a 
quality specification. 


8.4.4 Analysis of Data 


It includes analysis of tolerances, correlation, analysis of variance, analysis for 
engineering design, problem solving technique to estimate cause to troubles. Statistical 
methods can be used in arriving at proper specification limits of product, in designing the 
product, in purchase of raw-material, semi-finished and finished products, 
manufacturing processes, inspection, packaging, sales and also after sales service. 


8.5 Advantages of Statistical Quality Control 
[Kanpur B.C.A. 2004, 2005; B.C.A. (Lucknow) 2007, 2008; B.C.A. (Agra) 2006; 
B.C.A. (Meerut), 2001, 2003; B.C.A. (Rohtak) 2008, 2009, 2011] 
l. Efficiency: The use of statistical quality control ensures rapid and efficient 
inspection at a minimum cost. It eliminates the need of 100% inspection of 


finished products because the acceptance sampling is statistical quality control 
exerts more effective pressure for quality improvement than it. 


2: Increases out-put and reduces wasted machine and man hours. 


3: It uncovers the cause of excessive variability in manufactured products fore costing 
trouble before rejections occur and reducing the amount of spoiled work. 


4. Points out when and where 100% inspection, sorting or screening is required. 


Easy Detection of Faults: In S.Q.C. after plotting the control charts (X, R, P.G, 
U) etc., when the points fall above the upper control limits or below the lower 
control limit, it gives an indication of deterioration in quality, we can take 
immediately correct action. 


6. Elimination of bottlenecks in the process of manufacturing. 


7 Provides a common language that may be used by all three groups designers, 
production personnel and inspectors in arriving at a rational solution of mutual 
problems. 


8. Adherance of Specifications. 
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8.6 Reasons for Variations in the Quality of a Product 


Two extremely similar things are rarely obtain in nature. This fact hold good for 
production processes as well no production process is good enough to produce all items 
of products exactly alike. Variation in their quality is bound to occur. If we analyse this 
variation, we observe that there are certain causes responsible for this variation. The 
variations are due to two main reasons. 


8.6.1 Chance or Random Cause or Non~-Assignable Causes 


The Variations due to chance causes are inevitable in any process or product. They are 
difficult to trace and to control also even under best conditions of production. This type 
of variation is unavoidable and no reason is possible to assign for such variation. We can 
say that this variation occurs in a random way. This type of variation can not be 
prevented and can not be detected and therefore is called allowable or tolerable and does 
not affect the quality and utility of the product. 


If the variations are due to chance factors alone, the observations will follow a “‘normal 
curve” the knowledge of the behaviour of chance variation in the foundation on which 
control chart analysis rests. The conditions which produce these variations are 
accordingly said to be “under control”. 


If the variations in the data do not conform to a pattern that might reasonably by 
produced by chance causes, then in this case, conditions, producing the variations are 
said to be “‘out of control”. 


8.6.2 Assignable Cause or Preventable Causes 


When the cause of variation in quality are such that they can be detected , assigned to 
some reasons and rectified to prevent further occurrence of such variability, they are 
known as assignable or preventable causes. The variations due to assignable cause possess 
greater magnitude as compare to those due to chance causes and can be easily traced or 
detected. The power of the shewhart control chart lies in its ability to separate out these 
assignable causes of quality variations e.g. in length thickness, weight or diameter of a 
component. 


The Variations due to assignable causes may be because of following factors: 
(i) Differences among machines 


(ii) | Differences among workers 


(iii) Differences in their relationship to one another. There variations may also be 
caused due to change in working conditions, mistake on the part of operator etc. 
Thus affects the quality and utility of the product and cause worry to the 
manufacture. 
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8.7 Techniques of Statistical Quality Control 
[B.C.A. (Meerut) 2003, 2005; B.C.A. (Lucknow) 2004, 2008] 


In statistical quality control our aim is to bring out the assignable causes of variation as 
soon as they occur, so that the actual variation may be supposed to be due to chance 
causes alone. 


To control the quality characteristics of the product, there are two main techniques. 


8.7.1 Process Control 


It is the process of monitoring and measuring variability in the performance of a process 
or a machine through the interpretation of statistical techniques and it is employed to 
manage in-process quality. This technique ensures the production of requisite standard 
product and make use of control charts. In this type of problem our aim is to control the 
manufacturing process so that the proportion of defective items is not large. 


8.7.2 Product Control 


This technique is concerned with inspection of already produced goods to ascertain 
whether they are fit to be despatched or not. To achieve the objectives, it makes use of 
sampling inspection plans. In this type of problem, we like to ensure that lots 
manufactured goods do not contain an excessively large proportion of defective items, 
Process control is achieved mainly through the technique of control charts, whereas 
product control is achieved through sampling plans. This process can be understand by 
following chart: 


R-chart 


o-chart 


Process Control 


Attributes 


Product Control 


Fig. 1 
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8.7.3 Control Chart 


A control chart is a graphical representation of the collected information that remove the 
variation in processing and warns if there is any departure from the specified tolerance 
limits. We can say control chart is a device for attaining quality control or is a device to 


judge whether the statistical control has been attained. 


8.7.4 Control Limits 


The control limits on the chart are so placed as to disclose the presence or absence of the 
assignable causes of quality variation which make the diagenosis possible and brings 
substantial improvements in the product quality and reduction of spailage and rework. 
Control chart tells when to leave the process alone and thus prevents unnessarily 
frequent adjustments that tend to increase the variability of the process rather than to 
decrease it. A control chart is simply a frequency distribution of the observed value 
plotted as points in order of occurrence so that each value has its own identity relative to 


the time of observation. The points on the control charts may or may not be connected. 


In control charts, the quality characteristic (Q) of manufactured product follows a 
distribution only when random causes are present in the process. This distribution has 
the mean Q and standard deviation SQ. If this distribution is normal the limits Q + 309g 
consists of 99.73 points of the distribution. The limits Q+ 20Q and Q + OQ consists of 
95.44% and 68.26% points of the distribution respectively. If the distribution of Q is not 
normal, the limits Q ad 30Q consists 88.89% points of the distribution. Therefore, in 
every case, these limits contain all the points of distribution and if any point is beyond 
these limits, we find that not only random causes are present in the process but the 
assignable causes are also present. In other words, we can say that the process is out of 
control. If there is no point outside the limit, then the process is under control. From 


above, we conclude that in SQC we don’t control the quality but we control the process. 


By considering these data, we state earlier a central line Q, upper control limit Q+ 30Q 
and lower control limit Q - 369g are drawn as horizontal straight lines on a graph paper 
and the number of samples known as homogenous sub-groups taken from different shifts 
of the process are plotted on the same graph paper. The sub-group number is taken on 
X-axis and quality characteristic Q is taken on Y-axis. The graph, therefore obtained is 


known as control chart. An example of control chart is shown on the next page. 


There are many types of control charts designed for different control situations. Most 
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Q+ 30Q > a oe a ea a Mee UCL: 


Control line 


O l 2 3 4 k x 


Fig. 2: Sub-group number 


commonly used control charts are: 

(i) | Control Charts for Variables: They are useful to measure quality characteristics 
and to control automatic process. It includes X and R-charts and charts for X and o. 

(ii) Control Charts for Attributes: It includes P-chart for fraction defective. A 


E 


fraction defective control chart discloses erratic fluctuations in the quality of 
inspection which may result in improvement in inspection practice and inspection 


standards. 


It also includes C-chart for number of defects per unit. 


8.8 Object of Control Charts 


shows the variation in the average of samples. 


It is the most commonly used variables chart. R-chart shows the uniformity or 
consistency of the process. It is the chart for measure of spread. o-chart shows the 


variation of process. 


procedures. 


or acceptance procedure or both. 


a fundamental change in the production line or to tell whether the process is in 


control and it so, at what dispersion. 


X and R or X and o charts are used in combination of control process. X-chart 


To secure information to be used in establishing or changing production 
To secure information to be used in establishing or changing inspection procedure 


To determine whether a given process can meet the existing specifications without 
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8.9 Type of Control Charts 


Control Charts is divide into two groups: 


l. 


Control Charts for Variables 


When quality is measured through numerical measurement, control charts are 
called control charts for variables. For example measurements of resistance of wire, 
density of liquid, volume of solid body are of continuous type and follow the 
normal probability law. For quality control of such data, the following control 
charts are used: 

(i) Control chart for mean (X) 

(ii) Control chart for S.D. (o-chart) 

(iii) Control chart for range (R-chart) 

Control Charts for Attributes 


If quality characteristic is on attribute, and each item is regarded as either defective 
or non-defective. 


There are three types of control charts in practical. 
(i) | Control charts for function defective (p-chart) 
(ii) | Control charts for number of defective (np-chart) 


(iii) Control charts for number of defective (C-chart) 


8.9.1 Construction of Control Charts for Variables 


First of all, a random sample of size n is taken during a manufacturing process over a 


period of time and quality measurements x1, 79 ....,%, are noted. 


n 
= A Fi +...44% ] 
Sample mean = 7 = ts Te 2 Xj 
n n 


Sample range = R = Xmax ~ ¥min 


If the process is found stable, k consecutive samples are selected and for each sample, ¥ 


and R are calculated. 


Then find mean (F) and sample range (R) as follow: 


and 


This ¥ is called control line (C.L.) 
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For X-chart 
: X, when tolerance limits are not given 
Control line = ` a ee 
u, when toterence limit are given 

where u= 5 [L.C.L + U.C.L] 

L.C.L. = lower control limit 

U.C.L = upper control limit 
Now, L.C.L. (for X -chart) = ¥ - Ay R 
and U.C.L. (for X-chart) = ¥ + Ay R 


Ay, depends on sample size n and can be found from the following table: 


Sample p = 7 s 3 
S 2 3 4 5 6 7 9 10 | 11 12 13 14 15 16 | 17 | 18 19 20 
Size (n) 


For R-chart: Control line (C.L.) = R 
and L.C.L. (for R-chart) = D3 R 
U.C.L. (for R-chart) = D4R 


Where Ds and D4 depend on sample size are found from the following table: 
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To Compute upper and lower process tolerance limits for the value of x, we find 


2 UTL=7+% 
dy dy 


GO 


L.T.L=x- 


Where dy is found from the above table. 


The process capability is given by 60 =6 ~ 
2 


Where o is standard deviation. While plotting the X-chart the control line on the 
X-chart should be drawn as a solid horizontal line at X. The upper and lower control 
limits for X-chart should be drawn as dotted horizontal lines at the computed values. 


Similarly, for R-chart, the control line should be drawn as a solid horizontal line at R. 
The upper control limit should be drawn as dotted horizontal line at the computed value 
of UCLpR R 


If sub-group size is 7 or more, the lower control limit should be drawn as dotted 
horizontal line at LCLr . However, if the sub-group size is <6, the lower control limit for 
R is zero. 


Conclusion: If the observed values fall within the control limits, the process is assumed 
to be in control at that moment of production. If the observed values fall out side the 
control limits, the process is said to be out of control at that moment and presence of 
assignable causes is indicated in the process. However, points which fall outside the 
control limits do not necessarily represent rejected material but only signas that some 
corrective action is required to prevent manufacturing faculty parts. 


8.10 Control Charts for Attributes 
[B.C.A. (Meerut) 2005, 2003] 


Following control charts will be discussed here: 


1. P-chart 
2. np-chart 
3. C-chart 
4. u-chart 


As an alternative to X and R-chart and as a substitute when characteristc is measured 
only by attribute, a control chart based on fraction defective p is used, called P-chart. 


B Number of defective articles found in any inspection 
Total number of articles actually inspected 
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Control Limits (30 limits) on p—Chart 


We know that for binomial distribution, the mean value of total number of 
defectives in a sample n is np and standard deviation is Jnpq ; 


where ptq=l or g=I-p. 
Mean value of fraction defective is p and S.D. 


np(l-p)  JpU-p) 


P n n 
C.L.= p 


The upper and lower limits for P-chart are: 


(l= p) 
U.C.Ly =p +30, =p+3 Jo 


(l= p) 
and LC.Ly =p-30, =p-3 = 


Control Limits for np-Chart 


Whenever sub-group size is variable, P-chart is used but if it is constant the chart 
for actual number defectives called np called np chart is used 


C.L.=np where p= znj 
Èn 
U.C.L.np = np +30np =np +3 Vp l- p) 


where Onp =NOy 
and L.C.Liny = mp — 3,/np (1—p) 


Control Limits for C-Chart 
(i) Difference between a defect and defective. 


(ii) An item is called defective if it fails to conform to the specifications in any of 
the characteristics. Each characteristic that does not meet the specifications 
is a defect. An item is defective if it contains atleast one defect. The np chart 
applies to the number of defectives in sub-groups of constant size while 
C-chart applies to the number of defects in a sub-group of constant size. 


(iti) Basis for control limits on C-chart: Control limits on C-chart are based on 
Poisson distribution. 


(iv) Calculation of control limits on C-chart: Standard deviation o, = Jc 


~m 
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Thus 36 limits on a C-chart are: 


U.C.Le =C +347 
LCL- -3/7 


and control line C.L.= C 


where T- number of defects in all samples 


total number of samples 


u-chart: When the sub-group size varies from sample to sample, it is necessary to 
use u-charts. The control limits on u-chart will however vary. If C is total number 
of defects found in any sample and n is number of inspection units in a sample. 


— c number of defective in a sample 
ioo āe SSAI 
n number of units in a sample 


The control limits on u-chart are 


U.C.L =m +3,|*; LCi, =7 -3# 
n n 


and control line CL. =x 


There are two ways of obtaining these limits. One way is to use the value of ranges 
and another way is to use the values of standard deviations. 


On the Basis of R (Range) On the Basis of s (S.D) 


From the sampling distribution of range If the control limits are to be found 
on the basis of s except R, then the 
E(R) = dy Sp estimate of o,can be found from 


Where dy is a constant depending upon te 
the sample size. Thus an estimate of Op 


can be found from R by 
Op = R /dy 


Also ¥ gives an unbiased estimate of the 
population mean Hp- 


k 
=l 


E(¥2;) 


E) = ——— =n, 
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Example 1: Adriling machine bores hales with a mean diameter 0.5230 cm and a standard 
deviation of 0.0032 cm. calculate the 2-sigma and 3-sigma upper and lower control limits for means 


of sample of 4. 
[B.C.A. (Lucknow) 2008, 2006; B.C.A. (Rohtak) 2008; 
B.C.A. (Kanpur) 2006; B.C.A. (Meerut) 2005; B.C.A. (Agra) 2004] 


Solution: Mean diameter ¥ =0.5230 cm 
S.D. (o) =0.0032 cm 
n=4 
(G)  2-sigma limits are: 


C.L. = ï =0.5230 cm 


U.C.L.=74+ 2- =05230 +2 x OM -0.5262 cm 
vn V4 
iat 29 ©5950 -2 OO -0.5198cm 
vn V4 

(ii) | 3-sigma limits are: 

C.L. = ¥ =0.5230 cm 
U.C.L.= 7+3 -L = 0523043 x ore ~0.5278em 
vn V4 

2 0.0032 
L.C.L.= 7-3 -Z =0.5230 -3x =0.5182 cm 
vn V4 


Example 2: In the production of certain rods, a process is said to be in control is the outside 
diameters have a mean 2.5" and a s.d. of 0.002". 


(i) Find the control limits for the mean of random samples of size 4. 


(ii) Mean of 10 random samples taken at regular interval were: 2.5014, 2.5022, 2.4995, 
2.5076, 2.504, 2.4993, 2.4962, 2.4966 and 2.497 1, was the process even out of control. 


Solution: Given that ¥=2.5",0=0.002" n=4 


(i) Control limits for X-chart 


The control line (C.L.) = ¥ + ay =2.5+ pa 
vn V4 
=2.5+0.003 =2.503" 
LCL. =7-32 295-3 200% 2 5_6 0009 =2.497" 


V4 V4 


Cso) Elements of Statistics 


(ii) On the basis of control line U.C.L and L.C.L. and plotting the value of 10 means, it 
is found that process is out of control corresponding to sample number 4,5,8 and 9 


2.5010 
2.5008 
2.5006 


2.5004 
2.503 
2.5002 


2.500 


2.498 
2.497 
2.496 


2.494 
2.492 


I 2 3 4 5 6 7 8 9 10 


Fig. 3: Sample number 
Mean chart showing process out of control. 


Example 3: The following are the mean lengths and ranges of lengths of a finished product from 10 
samples each of size 5. The specification limits for length are 200 + 5 cm construct X and R-chart 


and examine whether the process is under control and state your recommendations. 
smi [2 e e e E 


eo eee dedede 


Assume for n=5, Ay =0.58, Dg =2.1 land D3 =0 


[B.C.A. (Lucknow) 2010, 2006, 2002] 


Solution: 
(i) Control limits for X chart: 
Control limits (C.L.) =200 
Tolerance or specification limits are given: 
u =200 
U.G.L.; =%+ A&R =p+ AR 
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Ri + Ro +...+R 


L.C.Ly =A 2 r T 0 
10 10 
Then, U.C.L.z =200 + (0.58x4.6) = 202.668 


L.C.L.z =200 - (0.58x4.6) =197.332 


(ii) Control limits for R-chart: 
Control limit (C.L.) =R =4.6 
U.C.L.g = D4R =2.11x4.6 =9.706 


L.C.L.p = D3 R =0x4.6=0 


The X and R-charts are drawn below: 


~------------- U.C.L. = 202.668 


Sample mean 200 C.L. = 200 


number (X) 
Sse ues L.C.L. = 197.332 


195 


190 


l 2 3 4 5 6 T 8 9 10 
Sample numbers 


Fig. 4: X-chart 


12 U.C.L. = 9.706 


Sample range (R) > 


l 2 3 4 5 6 7 8 9 10 L.C.L. = 0 


Sample numbers > 


Fig. 5: R-chart 
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Example 4: Construct a control chart for mean and range for the following data on the basis of 
fuses, sample of 5 being taken every hour (each set of 5 has been arranged in ascending order of 
magnitude). Comment on whether the production seems to be under control. Assuming that these are 
the first data: 


ee e e e e e ee 
rs os Pe [oe [or | oe [oo [me [om few om Pe 
rs fo [oo fo [ar [os [= [a |» [ro pe 


Pe fafa Pr [|e [so fo few fio 


Solution: 


Caffe ro [oe [a 


fal ofa fs [oes [| 
o feefee e e e 
O feefee e [mf « [3 
o fefoje pe e [se [ima |e 
C fefoje fiw ne [ae e e 
ea 8 


Total 


From the above data we obtain 
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M> 


Control limits for X-Chart 


U.C.L. = X+ Ag R = 72.1+0.58x58 = 105.74 


where Ag =0.58 at n=5 


L.C.L.= X -Ax R = 72.1-0.58x58 = 38.46 


CLX- 
100 


80 C.L. = 72.1 


Sample number 
lon 
ó 


l 2 3 4 5 6 7 8 9 10 I1 12 
Sample number 


Fig. 6 


For R-chart 


From the about chart it is clear that process is out of control corresponding to the 8th and 
10th samples. 
U.C.L. =D4R = 2.11x58 = 122.38 
(The value of D4 for n = 5 is 2.11) 
L.C.L.=D3 R =0x59.67=0 


G.L:=58 


—— 


C354) 


130 
120 
110 
100 


I 23 4 5 


6 7 
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C.L. = 58 


8 9 10 11 12 


Fig. 7 


Since all the sample points fall within the control limits the process is within control. 


Example 5: 20 random samples of 5 units drawn from each lot of wire gave the following mean 
diameters and ranges. Find out control limits for X and R-chart and comment. 


Sub-group 


a 
ae 
es 
[e] 


e p o e 
Pea] 
D aaa 


o e 
a j 
Lo a 
eo o 
64.8 
3 
i 
as 
ec 


Solution: We have 20 samples and size of each is 4 


KA k 
Tet 
k 
= AR 
and R= 


k 


=20,n=4, 5X =1263.4, ER =215 


1263.4 


=63.17 


al — =10.75 


20 


—_ 
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Control limits for X-chart 
Control line (X) =63.17 
U.CL=X+AgR =63.17+0.58x10.75 
(where n=5, and Ay =0.58) 

U.C.L. = 63 .17+ 6.235 =69.5 

L.C.L.= X —AgR =63.17-0.58%10.75 
=63.17-6.235 

5 

Where 1 =5 and D3 =0 from table. 


From the value of X given above, it is found that the process is out of control 
corresponding to sample number 17 and from the value of R given above, it is found that 
the process within the control. 


Example 6: In a blade manufacturing factory, 1,000 blades are examined daily. Draw the 
np-chart for the following table and examine whether the process is under control ? 


me PP PEEP E E 


[B.C.A. (Kurukshetra) 2011, 2007; B.C.A. (Lucknow) 2008, 2006, 2005; 
B.C.A. (Kanpur) 2004; B.C.A. (Meerut) 2004, 2002] 


Solution: We have n=1,000 


Xnp = total number of defectives = 166 


Xn = total number inspected = 1,000 x15 


Znp_ — 166 0.011 


np =1,000x0.011=11 
Cotrol limits are CL.=ap=11 


U.C. Lap = np + 3 np 0- P) =11+3 110-0011) =20.894 


L.C. Lap = np -3 4mp -p =11-3 J110-0.01]) =1.106 


The np-chart is drawn in the figure and we see that all the points lie within the control 
limits and process is under control. 


Elements of Statistics 


25 

92.894 

OOL °° °° OC ae 

2 U.C.L = 20.894 
3 15 
g 
$ 11 CL=11 
z 6 
$ 
2 5 LCL = 1.106 
5 | 1.106 | -------2---2--nnnnnnnnnnnnnncennnnnss 


1 2 3 4 5 6 7 8 9 10 Il 12 13 14 15 
Sample number 


Fig. 8: np-chart 


Example 7: If the average function defective of a large sample of a product is 0.1537, calculate the 
control limits given that sub-group size is 2,000. 


[B.C.A. (A.M.U.) 2011, 2007, 2005; B.C.A. (Meerut) 2008, 2004] 


Solution: Average fraction defective p =0.1537 
sub-group size is 2,000 i.e. n = 2,000 
control line C.L. = np = 2,000 x0.1537 =307.4 
U.C.Lijy = np + 30yy = np + 3 pipl -P 
=307.4 +3 4307.4 (1-0.1537) =307.4 + 48.38774204 
=355 .787742 
and L.C.Liyy =p -3 J np (1 P) =307.4 -48.38774204 


=259 .012258 


Example 8: A control chart for defects per unit w uses probability limits corresponding to 
probabilities of 0.975 and 0.025. The control line on the control chart is at u =2.0. The limits 
vary with the the value of n. Determine the correct position of these upper and lower control limits 
when n=5 (assume © =1.96) 


Solution: U.C.L, =a + aj? =2+ 1.96] = 3.299 
n 


and LCL, =2-1.96 |? =0.761 


ea 
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Example 9: In a manufacturing process, the number of detectives found in the inspection of 20 lots 
of 100 samples is given below: 


(i) Determine the control limits of p-chart and state whether the process is in control. 


(ii) Determine the new value of mean fraction defective if some points are out of control. Computer 
the corresponding control limits and state whether the process is still in control or not. 


(iii) Determine the sample size when a quality limit not worse than 9% is desirable and a 10% 
bad product will not be permitted more then three times in thousand. 


Solution: 
(i) We have A total number gt detectives _ 120 -0.06 
total number of items inspected 20x100 
Uige | P 2p 965) OO e200) 
? n 100 
=0.13095 
p(l -p 0.06 (1-0 .06 
Lely pss] P pgg] UO aps 
F n 100 


Since the defective can not be negative then L.C.L., = 0 after observing the values 
of defectives in the given examples, it is clear that only 8th lot having fraction 


defective oe =0.15 will go above U.C.L. 
100 P 


120-15 


(ii) After eliminating the 8th lot revised value for p =———— = 0.056 
100 x19 
The revised control limits will be 
U.C.L,, =0.056 +3 CA E =0.125 


100 
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(iii) 


L.C.L, =0.056 -3 error 000) =-0.013 =0 
P 100 


It is clear that all the points are within control limits 


revised quality level p =0.056 


Since a probability that a defective worse than 9% defective quality will not be 
permitted is more than 3 times in thousand (0.3%) is corresponding 36 limits 


p+3p=0.09 


m 0.056 CIEE 
n 


0.056 x0.944 0.034 


or —— e 
n 3 
2 
0.056x0.944 (0.034 
mee ea (=) = (0.01133)? 
n 3 
0.056 x0.944 
=> n -o 855, 


~ 0.01133 x0.01133 


8.11 Control Charts for Variables Versus Control Charts 


for Attributes 


Variable Charts Attribute Chart 
1. Separate charts are constructed for|1. A single chart is considered sufficient 
each quality characteristics. to serve the purpose. 
2. The data is collected purposively for | 2. Data collected for other purposes may 
the use of variable charts. also be used. 
3. Cost of collecting information on |3. Cost of collecting information on 
units is large. units is very small. 


4. As the control charts are sensitive to | 4. These charts are less sensitive to 


Ta 


assignable causes, these charts assignable causes, these charts are not 


provides better quality control. much effective in quality control. 


5. Usually a sample of size 4 or 5 serve | 5. Usually a large sample size works for 
the purpose. correct conclusions. 


6. 


In general, sub-group size is taken to | 6. The need for equal size of samples is 
be equal. not much required as in case of 
variables. 


Computational procedure is lengthy | 7. Computational procedure is easy and 
and time consuming. simple. 


—_ 
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8.12 Advantage of Statistical Quality Control 


We know that by S.Q.C, the quality of a certain product is improved by bringing the 

process under control. The following are some advantages of S.Q.C. 

1. By correcting (repairing) the plaint at an appropriate time, a considerable amount 
of cost per unit is reduced. 

2. A considerable amount of excess raw-material is avoided. 

The inspection of each and every unit is not practicable and reduce the efficiency 
of the quality inspectors. S.Q.C. keeps the workers alert. 

4. S.Q.C. beeps continuous vigilance on the quality of the product. As soon as the 
presence of assignable causes is feit the concerned engineer is informed and the 
related part is corrected. In this way, the quality is improved and probability of the 
company increases. 


5. The number of defective items become less and amount of scrap reduces. 

6. S.Q.C. is economical in the sense that only a fraction of items produced is 
inspected. 

I It provides protection against losses to the producer as well as to consumer. 


8.13 Limitation of Statistical Quality Control 


A number of advantages are discussed above and the usefulness of S.Q.C. is focused. But 
S.Q.C. is not only the single technique for controlling all type of evils in production. A 
part from S.Q.C. many more consideration are made in the production of units. We can 
say that S.Q.C. is one of the many considerations which are made in the production 
system. That is S.Q.C. is a part of the production process rather than the production 
being a pat of S.Q.C. 


Control limits for X-chart 


On the Basis of R On the Basis of 5 


After obtaining the upper and lower control limits and all the values of statistics, 
sub-groups (samples) numbers are taken on x-axis and the value of statistics on y-axis. 


All the points thus plotted are connected and chart is found. If all the points lie within 
the control limits, we can say that the process is under control and if any point goes 
outside the control limits, we can say that the process is out of control. 


And some assignable causes are present in the system that must be corrected. 


C) Elements of Statistics 


8.14 Control Chart for Number of Defects per Unit 
(C-Chart) 


If c; is the number of defects observed in i-th unit, then then estimate of À is given by ĉ as 


it is an unbiased estimate of À i.e. E(c),=A. 


The value of ¢ is calculated by the following formula: 


-_  totalnumber of defects c} + c3 +... + Ch 
TO ee 
number of units observed k 


Thus, 3-sigma are given by: 
U.C.L. = + 34/7 
L.C.L. =ē -34/7 
Control line =¢ 


If L.C.L. comes out to be negative, it is considered as zero as the number of defects can 


not be negative. 
Example 10: The number of defects in 24 rolls of cloth each of 100 meters length is given by: 
3,5,8,9,4,2,5,9,6,4,8,12,7,5,10,10,7,5,4,10,9,5,8,13 


Draw a suitable control chart and give your comments. 
[B.C.A. (Kanpur) 2006, 2002; B.C.A. (Meerut) 2005] 


Solution: Ifc; denotes the number of defects in the i-th group, then we have the average 


number of defects 


34+54+84+94...494548413 _ 168 _. 


C= =— = 
24 24 


control limit (¢) =7 


3-sigma control limits for C-chart 


U.C.L.=7 +3 [@=74+3 [7=7+3x2.645 =14.937 


L.C.L.=7 -3 [z =7 -3 [7=7-3x2.645 =0.937 =0 


p 
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14 C-chart 


13 
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O j= 
e 
. 
. 
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1 3 5 7 9 M 13 15 17 19 21 23 25 Le 


Fig. 9 
Since all the points are with in the control limits, the process is in a state of statistical 


control. 


Example 11: Twenty five boxes, each containing 20 electric switches were randomly selected and 
inspected for the number of defectives in each box. The number of defectives found in each box were as 
follows: 


Ranier [22] *]s]o]7 [a> [ola alay 
na fee [2 |2[1fol«[2}r[2}sfol2fi [2 


Banmer [afis [rs iz [isis [a0] a [222s [2 [as] _ 
me foes [ols [sle eO ie beO 


Set up control limits for a number of defectives. 
[ B.C.A. (Rohtak) 2011; B.C.A. (A.M.U.) 2010, 2006; B.C.A. (Agra) 2006] 
k 
Solution: Total number of defectives = 5 d; =48 
i=l 


di 
i=l _ 48 

nk 20x25 
q =1-p=1-0.096 =0.904 


np =20 x0.096 =1.92 


=0.096 


p= 
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Control limits for number of defectives are: 


Control line = np =1.92 


U.C.L.= np +3 npg =1.92 + 31.92 x0.904 =1.92 +3 J1.73568 


=1.92+3 «1.31745 =1.92 +3.95 =5.87 
L.C.L=np -3,Jnpq =1.92 -3,/1.92 x0.904 =1.92 -3,/1.73568 


=1.92 -3 x1.31745 =1.92 -3.95 =-2.03 =0 


Since the number of defectives can never be negative, the lower control limit is taken 


zero. The control chart for number of defectives drawn below: 


Example 12: The following are the figures of defectives in 22 lots each containing 2,000 rubbers 
belts: 


425, 430, 216, 341, 225, 322, 280, 306, 337, 305, 356, 402, 216, 264, 126, 409, 193, 
326, 280, 389, 451, 420 


Draw control charts for fraction defective and comment. [B.C.A. (Meerut) 2005, 2003, 2002] 


Solution: Here n=2,000 for each lot. If d; and p; are the number of defectives and 


sample fraction defective for ith lot respectively, then 


= (i=1,2,3...,22) 


Pi = 7000 


These can be shown in following table: 
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M= 


izi _ 3.5095 


Thus, p=- =0.1595 


g =1-0.1595 =0.8405 


Control line (C.L.) = p =0.1595 


Uciepes i se | ae an 
n 2,000 


=0.1595 + 3 0.000067 =0.1595 +0.0246 =0.1841 


it 7 A AE E ee 
n 2,000 


=0.1595 -3 0.000067 =0.1595 -0.0246 =0.1349 


p-chart for fixed sample size. 


230 
220 
210 
200 
190 
180 
170 
160 
150 
140 
130 
120 


2 4 6 8 10 12 14 16 18 20 22 24 
Fig. 10 


From the chart it is found that a number of points fall outside the control limits, hence 
the process is out of control. 
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Example 13: Given control limits to control the variability in production when o is not known. 


Use the following values to study the variation in the production of screws when D3 =0 and 
D4 =2.282 for n=4. 


Range (m.m) 


~ lw, 28 
Solution: We have R =-= YR =—=3.5 
k 4 


Therefore, control limit for R-chart 
WeG.L. =D4R =2.282 x3.5 =7.987 


L.C.L. = D} R 30x35 =0 


C.L.=R =35 


Example 14: In the following data draw X-chart and also given your comments: 


Sample Numbers Sample Observations 


ete [«[se[a 


ee | 
a 
a a) 


Given for: n=4, Ay =0.58, n=8, Ag =0.373. 
[B.C.A. (Lucknow) 2009, 2007; B.C.A. (Kanpur) 2007, 2004, 2002; 


B.C.A. (Agra) 2005, 2003] 


—— 
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Solution: 


Sample Numbers FT Observations Mean (X) 


os [Toofos [raf sr [ora fn 
ofe fe Por pe a 
eee 652 [7 


E E E 
P= ovat 515.2 [ts 


Control limits for X-chart 
U.C.L.= X + Ay R = 64.15 + 0.58 x18.5 =74.88 


L.C.L. = X - Ay R =64.15 -0.58x18.5 =53.42 


C.L. = X -64.45 
Y 110 


100 
90 


U.C.L. = 74.88 


Sample mean (X) 


C.L. = 64.15 
L.C.L = 53.42 


1 2 3 4 5 6 7 8 9 10 11 12 13 
Sample number 


Fig. 11 
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Example 15: The average number of defectives in 22-sampled lots of 2,000 rubber belts each was 
found to be 16%. Indicate how to construct the relevant control limits. 


[B.C.A. (Kashi Vidhyapeeth) 2011, 2009, 2007] 


Solution: Number of lots k = 22 
Number of rubber belts inspected per lot = 2,000 


Average number of defectives per lot = 16% 
Therefore, proportion of defective per lot = a =0.16 
i.e. p=0.16 


Control limits for p-chart 


Cons rra hgg AN 
n 2,000 


ia =0.16 +3 40.0000672 


? 


=0.16+3 x0.0082 =0.16+0.0246 =0.1846 


LCL =p-3 (PUP) 9 16-3 es 
n 2,000 
0.1344 
=0.16 -3 |-———— =0.16 -3 ,0.0000672 
2,000 


=0.16 -3 x0.0082 =0.16 -0.046 =0.1354 


iia 
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Exercise 


What do you understand by statistical quality control ? Discuss briefly the need 


and utility of control charts in industry. 
[B.C.A. (Meerut) 2004] 


What do you understand by statistical quality control ? Explain C-chart in detail 


with advantage. 
[B.C.A. (Meerut) 2002] 


Write short notes on 

(i) X-chart (ii) Application of C-chart 

(iii) Process and product control (iv) Chance causes and assignable causes 
[B.C.A. (Kanpur) 2007, 2004; B.C.A. (Meerut) 2006, 2002] 


What are main control charts for variables ? Obtain their control limits. 
[B.C.A. (Meerut) 2008, 2004] 


What are the main control charts for attributes ? Obtain their control limits. 
[B.C.A. (Lucknow) 2008; B.C.A. (A.M.U) 2006] 


Write down the formulae for limits of C-charts. 
Write the name of any two control charts. 
Name the distribution. 

What limitation of statistical quality control. 


A company manufactures screws to a normal diameter 0.500 40.030 cm. Five 
samples were taken randomly from the manufactured lotes and 3 measurements 
were taken on each sample at different lengths. Following are the readings: 


S. No. Measurement per Sample x (in cm) 


0.498 0.515 0.487 
0.492 0.509 0.514 
0.490 0.508 0.499 


Calculate the control limits of X and R-charts. Draw X and R-charts and examine 
whether the process is in statistical control. (Taking Ay =1.02, Dy =2.57,D3 =0 
forn=3) 
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11. The Number of customer complaints received daily by an organization is given 


below: 


o» EEEE EEE ee 


ewmm |= [eof fof folofe]e [ofr [ofa [+ 
Does it mean that the number of complaints is under statistical control ? Establish 


a control scheme for the future. 


12. The following data shows the value sample mean X and range R for 10 samples of 
size 5 each. Calculate the values for control line and control limits for X-chart and 
R-chart and determine whether the process is under control: 


=e PEE pe 
meando [102 nis [ios rafa foefies] 96 [roe 0 


none [7 f+ fetsirtsls|sl7 | 


Assume for n =5, A =0.577, D3 =0 and Dy =2.115 


Explain the construction of the control charts for variables. Give 3-o lower and 
upper limits for X-chart, where the sample size in n. The following number of 
defects were found on 20 items during an inspection schedule. calculate control 
limits draw a control chart and comment on your result. 


3,8,5,8,6,3,4,7,8,10,2,8,6,4,5,3,0,2,6,6 


14. A machine is set to deliver packets of a given weight. 10 samples of size 5 were 


recorded. Below are given the relevant data: 


SampleNo. | 1 | 2 | 3 | 4 [5 | 6] 7 | 8 [9 | 10 | 


means |s or fas [uw [av pe fae [as [7 iw 
o 1717 1«[ole[7[elsfuls 


Calculate the values for control line and control limits for mean chart and range 


chart and comment on the state of control [Conversion factors for n=5 are 
A =0.58, D3 =0, D4 =2.1 1]. 


15. The following shows that the number of missing rivets observed at the time of 
inspection of 12 air crafts. Find the control limits for the number of defects chart 
and comment on the state of control: 


wewn [p pE E e 
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16. 


17. 


18. 
19. 


20. 


21. 


22. 


The following data show the values of sample mean (X) and range (R) for 10 
samples of size 6 each. Calculate the values for control line and control limits for 


mean and range chart. Draw control charts and comment on the state of control: 


mel PPP pps 


wwe Lo |e e e e e e 


(Conversion factors for n =6 are Ay =0.483, D3 =0, Dy =2.004). 


The average percentage of defectives in 27 samples of size 15,00 each was found to 
be 13.7% construct p-chart for this situation. Explain how the control chart can be 
used to control quality 


[Hint: p =0.137] 
It was found that when a manufacturing process is under control, the average 


number of defectives per sample batch of 10 is 1.2. What would you set in a quality 
control chart based on the examination of defectives in sample batches of 10 ? 


What are statistical quality control techniques ? Discuss the objectives and 
advantages of statistical quality control. 


[B.C.A. (A.M.U.) 2010; B.C.A. (Lucknow) 2008, 2006, 2004] 


The following are the number of defects found in 1,000 items of same industry 
goods inspected everyday in a month. Draw C-chart: 


m EN e e 
anana PPP e e e 
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The number of defects in 20 items each of 100 meters length is given below: 


1,3,3,1,6,4,3,7,10,2,2,6,4,3,2,7,1,5,6,4 


Draw the appropriate chart and give comment on your result. 


A manufacturer finds that on an average | in 10 of the items produced by him is 
defective. A few days later he finds 20 items in a sample of 100 items defectives. 


In the process out of control. Gives reasons for your answer. 


(C.L. =10, U.C.L. =19, L.C.L. =1 Process out of control). 


C550 ) Elements of Statistics 


23. Ina capability study of a lathe used in turning a shaft to diameter of 23.75 +0.1 
mm, a sample of 6 consecutive pieces were taken each day for 8 days and data was 


recorded as given below: 


— TESTES Eves 


Mean (X) | 23.765 | 23.77 | 23.7716 | 23.7767 | 23.7717 | 23.7583] 23.7767 |23.767 
Range (R) 0.08 0.04 0.05 0.07 0.07 


Plot X and R-Chart and state whether the process is in control or not. Establish the 


process capability and compare it with the stipulated specifications. 


For a sample of size 6, take Ay =0.48, D4 =2,D3 =0, dy =2.534. 


An Swers 


C.L. g =0.4988, U.C.L.ş OS L.C.L.¢ =(0.4804, 
C.L.p =0.018,U.C.L.p =0.0463, L.C.L.p =0, The process is in control. 
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